Mariana

The Mariana source code has been open sourced. Download the tar file Mariana.tgz below. I would love feedback.

Mariana is an auto-classifier algorithm that efficiently optimizes hyperparameters for Support Vector Machines.

Mariana is an algorithm that efficiently optimizes the hyperparameters for Support Vector Machines for regression and classification. It currently uses Simulated Annealing for optimization but can be extended to use a variety of stochastic optimization techniques including, Markov Chain Monte Carlo, Sequential Monte Carlo, and Genetic Algorithms. Mariana can be applied to the text portion of reports, determining the likely categories that each report falls into, and calculating a confidence for each classification. Mariana¹s innovation is it automates the search for the optimum hyperparameters. It does so by randomly selecting a set of hyperparameters. Next it builds a model from the training data and tests the model's performance using the validation set. That performance is compared to previous performances, and if the current set of hyperparameters are better than the previous one, then it records the hyperparameters. This process is repeated until there are no noticeable improvements in performance or at a predefined stopping point.

Data and Resources

Additional Info

Field Value
Maintainer DAWN MCINTOSH
Last Updated March 31, 2025, 20:34 (UTC)
Created March 31, 2025, 20:34 (UTC)
accessLevel public
accrualPeriodicity irregular
bureauCode {026:00}
catalog_@context https://project-open-data.cio.gov/v1.1/schema/catalog.jsonld
catalog_@id https://data.nasa.gov/data.json
catalog_conformsTo https://project-open-data.cio.gov/v1.1/schema
catalog_describedBy https://project-open-data.cio.gov/v1.1/schema/catalog.json
harvest_object_id fea0c7a5-9c78-4f7b-ba5f-bfd74e9180bb
harvest_source_id 61638e72-b36c-4866-9d28-551a3062f158
harvest_source_title DNG Legacy Data
identifier DASHLINK_117
issued 2010-09-10
landingPage https://c3.nasa.gov/dashlink/resources/117/
modified 2020-01-29
programCode {026:029}
publisher Dashlink
resource-type Dataset
source_datajson_identifier true
source_hash f67518f48e80f4c3f00b37ab088c8fd54c1670dfea23183f5ac0572b964d4790
source_schema_version 1.1