Ontology highlight
ABSTRACT:
SUBMITTER: Zhan X
PROVIDER: S-EPMC8276012 | biostudies-literature | 2021 Jul
REPOSITORIES: biostudies-literature
Zhan Xianghao X Humbert-Droz Marie M Mukherjee Pritam P Gevaert Olivier O
Patterns (New York, N.Y.) 20210617 7
Free-text clinical notes in electronic health records are more difficult for data mining while the structured diagnostic codes can be missing or erroneous. To improve the quality of diagnostic codes, this work extracts diagnostic codes from free-text notes: five old and new word vectorization methods were used to vectorize Stanford progress notes and predict eight ICD-10 codes of common cardiovascular diseases with logistic regression. The models showed good performance, with TF-IDF as the best ...[more]