Ontology highlight
ABSTRACT:
SUBMITTER: Jaffe A
PROVIDER: S-EPMC8425479 | biostudies-literature | 2020 Dec
REPOSITORIES: biostudies-literature
Jaffe Ariel A Kluger Yuval Y Lindenbaum Ofir O Patsenker Jonathan J Peterfreund Erez E Steinerberger Stefan S
Frontiers in applied mathematics and statistics 20201203
Word2vec introduced by Mikolov et al. is a word embedding method that is widely used in natural language processing. Despite its success and frequent use, a strong theoretical justification is still lacking. The main contribution of our paper is to propose a rigorous analysis of the highly nonlinear functional of word2vec. Our results suggest that word2vec may be primarily driven by an underlying spectral method. This insight may open the door to obtaining provable guarantees for word2vec. We su ...[more]