wordnet - WordNet 是如何诞生的

标签 wordnet nlp

我想知道WordNet中单词之间的层次关系是如何检索的。

这是手动完成还是通过计算机技术完成。

如果基于计算机技术,它们是什么?

最佳答案

来自常见问题解答:

q.1.2 Where do you get the definitions for WordNet? (short answer) Our lexicographers write them.

Where do you get the definitions for WordNet? (long answer) From the foreword to WordNet: An Electronic Lexical Database, pp. xviii-xix:

People sometimes ask, "Where did you get your words?" We began in 1985 with the words in Kučera and Francis's Standard Corpus of Present-Day Edited English (familiarly known as the Brown Corpus), principally because they provided frequencies for the different parts of speech. We were well launched into that list when Henry Kučera warned us that, although he and Francis owned the Brown Corpus, the syntactic tagging data had been sold to Houghton Mifflin. We therefore dropped our plan to use their frequency counts (in 1988 Richard Beckwith developed a polysemy index that we use instead). We also incorporated all the adjectives pairs that Charles Osgood had used to develop the semantic differential. And since synonyms were critically important to us, we looked words up in various thesauruses: for example, Laurence Urdang's little "Basic Book of Synonyms and Antonyms" (1978), Urdang's revision of Rodale's "The Synonym Finder" (1978), and Robert Chapman's 4th edition of "Roget's International Thesaurus" (1977) -- in such works, one word quickly leads on to others. Late in 1986 we received a list of words compiled by Fred Chang at the Naval Personnel Research and Development Center, which we compared with our own list; we were dismayed to find only 15% overlap.

So Chang's list became input. And in 1993 we obtained the list of 39,143 words that Ralph Grishman and his colleagues at New York University included in their common lexicon, COMLEX; this time we were dismayed that WordNet contained only 74% of the COMLEX words. But that list, too, became input. In short, a variety of sources have contributed; we were not well disciplined in building our vocabulary. The fact is that the English lexicon is very large, and we were lucky that our sponsors were patient with us as we slowly crawled up the mountain.

关于wordnet - WordNet 是如何诞生的,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/7752440/

相关文章:

命名实体的 Python 自然语言处理

python - 如何检测 NLTK Python 中文本的不确定性?

php - 从 php 调用 wordnet(Wordnet 类或 PHP 的 API)

c++ - 根据 POS 标签值更改同义词引理

java - 我想使用 WordNet 查找单词相似度

python - 将单引号替换为双引号并排除某些元素

python - 如何解决程序中同一产品(手机)的两个稍有不同的名称?

android - 在 Android 应用程序中访问 WordNet 字典文件

python - Nltk安装