Rethinking the Data-driven Discovery of Rare Phenomena. This project will investigate novel technologies for the data-driven discovery of rare phenomena. Scientific disciplines are increasingly able to generate large amounts of data relevant to key discoveries such as novel photovoltaic materials or explanations of brain seizures. However, these discoveries typically correspond to extremely rare phenomena in high dimensional spaces, which current data science methods are unable to detect. The pr ....Rethinking the Data-driven Discovery of Rare Phenomena. This project will investigate novel technologies for the data-driven discovery of rare phenomena. Scientific disciplines are increasingly able to generate large amounts of data relevant to key discoveries such as novel photovoltaic materials or explanations of brain seizures. However, these discoveries typically correspond to extremely rare phenomena in high dimensional spaces, which current data science methods are unable to detect. The project will fill this void and yield novel methods, publications, and open source software for the data-driven discovery or rare phenomena. Thus, it will expand the capabilities of data science, providing better use of the massive data collections accumulating across science, government, and industry.Read moreRead less