Ontology highlight
ABSTRACT:
SUBMITTER: Drozdova A
PROVIDER: S-EPMC10280557 | biostudies-literature | 2023
REPOSITORIES: biostudies-literature
Drozdova Anastasia A Trofimova Ekaterina E Guseva Polina P Scherbakova Anna A Ustyuzhanin Andrey A
PeerJ. Computer science 20230223
The use of program code as a data source is increasingly expanding among data scientists. The purpose of the usage varies from the semantic classification of code to the automatic generation of programs. However, the machine learning model application is somewhat limited without annotating the code snippets. To address the lack of annotated datasets, we present the Code4ML <i>corpus</i>. It contains code snippets, task summaries, competitions, and dataset descriptions publicly available from Kag ...[more]