Ontology highlight
ABSTRACT:
SUBMITTER: Deng K
PROVIDER: S-EPMC4896694 | biostudies-literature | 2016 May
REPOSITORIES: biostudies-literature
Deng Ke K Bol Peter K PK Li Kate J KJ Liu Jun S JS
Proceedings of the National Academy of Sciences of the United States of America 20160516 22
With the growing availability of digitized text data both publicly and privately, there is a great need for effective computational tools to automatically extract information from texts. Because the Chinese language differs most significantly from alphabet-based languages in not specifying word boundaries, most existing Chinese text-mining methods require a prespecified vocabulary and/or a large relevant training corpus, which may not be available in some applications. We introduce an unsupervis ...[more]