SIAM 2007 Text Mining Competition dataset
Data and Resources
-
TrainingData.txt.gzGZ
Training Documents
-
TrainCategoryMatrix.csv.gzGZ
Training Document Labels
-
TestData.txt.gzGZ
Test Documents
-
TestTruth.csv.gzGZ
Test Document Labels
-
Contest_Description_and_Rules.pdfPDF
Contest Description and Rules
-
ScoringSoftware.tar.gzGZ
Software to calculate contest scoring metrics
Additional Info
| Field | Value |
|---|---|
| Maintainer | Nikunj Oza |
| Last Updated | April 1, 2025, 00:09 (UTC) |
| Created | April 1, 2025, 00:09 (UTC) |
| accessLevel | public |
| accrualPeriodicity | irregular |
| bureauCode | {026:00} |
| catalog_@context | https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld |
| catalog_@id | https://data.nasa.gov/data.json |
| catalog_conformsTo | https://project-open-data.cio.gov/v1.1/schema |
| catalog_describedBy | https://project-open-data.cio.gov/v1.1/schema/catalog.json |
| harvest_object_id | 05ffbb41-ab47-4e62-9a24-2846d796f69b |
| harvest_source_id | 61638e72-b36c-4866-9d28-551a3062f158 |
| harvest_source_title | DNG Legacy Data |
| identifier | DASHLINK_138 |
| issued | 2010-09-22 |
| landingPage | https://c3.nasa.gov/dashlink/resources/138/ |
| modified | 2020-01-29 |
| programCode | {026:029} |
| publisher | Dashlink |
| resource-type | Dataset |
| source_datajson_identifier | true |
| source_hash | ff3e6b8c08175cc932e888eae5a004df8573782b2c8d56e40ede5cf0a5da6399 |
| source_schema_version | 1.1 |