Unknown

Dataset Information

0

BD2K Training Coordinating Center's ERuDIte: the Educational Resource Discovery Index for Data Science.


ABSTRACT: Data science is a field that has developed to enable efficient integration and analysis of increasingly large data sets in many domains. In particular, big data in genetics, neuroimaging, mobile health, and other subfields of biomedical science, promises new insights, but also poses challenges. To address these challenges, the National Institutes of Health launched the Big Data to Knowledge (BD2K) initiative, including a Training Coordinating Center (TCC) tasked with developing a resource for personalized data science training for biomedical researchers. The BD2K TCC web portal is powered by ERuDIte, the Educational Resource Discovery Index, which collects training resources for data science, including online courses, videos of tutorials and research talks, textbooks, and other web-based materials. While the availability of so many potential learning resources is exciting, they are highly heterogeneous in quality, difficulty, format, and topic, making the field intimidating to enter and difficult to navigate. Moreover, data science is rapidly evolving, so there is a constant influx of new materials and concepts. We leverage data science techniques to build ERuDIte itself, using data extraction, data integration, machine learning, information retrieval, and natural language processing to automatically collect, integrate, describe, and organize existing online resources for learning data science.

SUBMITTER: Ambite JL 

PROVIDER: S-EPMC9089329 | biostudies-literature | 2021 Jan-Mar

REPOSITORIES: biostudies-literature

altmetric image

Publications

BD2K Training Coordinating Center's ERuDIte: the Educational Resource Discovery Index for Data Science.

Ambite José Luis JL   Fierro Lily L   Gordon Jonathan J   Burns Gully A GA   Geigl Florian F   Lerman Kristina K   Van Horn John D JD  

IEEE transactions on emerging topics in computing 20190306 1


Data science is a field that has developed to enable efficient integration and analysis of increasingly large data sets in many domains. In particular, big data in genetics, neuroimaging, mobile health, and other subfields of biomedical science, promises new insights, but also poses challenges. To address these challenges, the National Institutes of Health launched the Big Data to Knowledge (BD2K) initiative, including a Training Coordinating Center (TCC) tasked with developing a resource for pe  ...[more]

Similar Datasets

| S-EPMC6249084 | biostudies-literature
| S-EPMC6044344 | biostudies-literature
| S-EPMC5641239 | biostudies-literature
| S-EPMC11851324 | biostudies-literature
| S-EPMC10432862 | biostudies-literature
| S-EPMC8634500 | biostudies-literature
| S-EPMC7242145 | biostudies-literature
| S-EPMC8449358 | biostudies-literature
| S-EPMC7968699 | biostudies-literature
| S-EPMC5892369 | biostudies-literature