Calculating association between technical terms based on co‐occurrences in keyword lists of academic papers
- 7 February 2003
- journal article
- research article
- Published by Wiley in Systems and Computers in Japan
- Vol. 34 (3) , 85-95
- https://doi.org/10.1002/scj.1197
Abstract
In this paper, the authors evaluate a method to calculate association between specialized terms using co‐occurrence information in author keywords from academic papers. When author keywords found in academic paper databases are used, co‐occurrence information in units of separate compound words can easily be obtained. However, because only a few words co‐occur in one paper, the problem of data sparseness arises. Thus, in this paper, the authors take into consideration indirect co‐occurrence relationships when computing relatedness. They create a large‐scale terminology graph which connects sets of keywords for a paper using co‐occurrence relationship links, then define the distance between two sets of terms using the average path length. In addition, the authors evaluate the validity of their proposed method for calculating association by applying the terminology graph created using a real, large‐scale academic paper database to the problem of automatic classification of texts, then compare the results to when direct co‐occurrence and context vectors are used. © 2003 Wiley Periodicals, Inc. Syst Comp Jpn, 34(3): 85–95, 2003; Published online in Wiley InterScience (www.interscience.wiley.com). DOI 10.1002/scj.1197Keywords
This publication has 9 references indexed in Scilit:
- Term-list translation using mono-lingual word co-occurrence vectorsPublished by Association for Computational Linguistics (ACL) ,1998
- Exploring Textual DataPublished by Springer Nature ,1998
- A cooccurrence-based thesaurus and two applications to information retrievalInformation Processing & Management, 1997
- A word-to-word model of translational equivalencePublished by Association for Computational Linguistics (ACL) ,1997
- Co-occurrence vectors from corpora vs. distance vectors from dictionariesPublished by Association for Computational Linguistics (ACL) ,1994
- Explorations in Automatic Thesaurus DiscoveryPublished by Springer Nature ,1994
- Similarity between words computed by spreading activation on an English dictionaryPublished by Association for Computational Linguistics (ACL) ,1993
- Identifying word correspondence in parallel textsPublished by Association for Computational Linguistics (ACL) ,1991
- An approach to the automatic construction of global thesauriInformation Processing & Management, 1990