Название | Linked Lexical Knowledge Bases |
---|---|
Автор произведения | Iryna Gurevych |
Жанр | Программы |
Серия | Synthesis Lectures on Human Language Technologies |
Издательство | Программы |
Год выпуска | 0 |
isbn | 9781681731841 |
Lecture #34
Series Editor: Graeme Hirst, University of Toronto
Series ISSN
Print 1947-4040 Electronic 1947-4059
Linked Lexical Knowledge Bases
Foundations and Applications
Iryna Gurevych, Judith Eckle-Kohler, and Michael Matuschek
Technische Universität Darmstadt, Germany
SYNTHESIS LECTURES ON HUMAN LANGUAGE TECHNOLOGIES #34
ABSTRACT
This book conveys the fundamentals of Linked Lexical Knowledge Bases (LLKB) and sheds light on their different aspects from various perspectives, focusing on their construction and use in natural language processing (NLP). It characterizes a wide range of both expert-based and collaboratively constructed lexical knowledge bases. Only basic familiarity with NLP is required and this book has been written for both students and researchers in NLP and related fields who are interested in knowledge-based approaches to language analysis and their applications.
Lexical Knowledge Bases (LKBs) are indispensable in many areas of natural language processing, as they encode human knowledge of language in machine readable form, and as such, they are required as a reference when machines attempt to interpret natural language in accordance with human perception. In recent years, numerous research efforts have led to the insight that to make the best use of available knowledge, the orchestrated exploitation of different LKBs is necessary. This allows us to not only extend the range of covered words and senses, but also gives us the opportunity to obtain a richer knowledge representation when a particular meaning of a word is covered in more than one resource. Examples where such an orchestrated usage of LKBs proved beneficial include word sense disambiguation, semantic role labeling, semantic parsing, and text classification.
This book presents different kinds of automatic, manual, and collaborative linkings between LKBs. A special chapter is devoted to the linking algorithms employing text-based, graph-based, and joint modeling methods. Following this, it presents a set of higher-level NLP tasks and algorithms, effectively utilizing the knowledge in LLKBs. Among them, you will find advanced methods, e.g., distant supervision, or continuous vector space models of knowledge bases (KB), that have become widely used at the time of this book’s writing. Finally, multilingual applications of LLKB’s, such as cross-lingual semantic relatedness and computer-aided translation are discussed, as well as tools and interfaces for exploring LLKBs, followed by conclusions and future research directions.
KEYWORDS
lexical knowledge bases, linked lexical knowledge bases, sense alignment, word sense disambiguation, graph-based methods, text similarity, distant supervision, automatic knowledge base construction, continuous vector space models, multilingual applications
Contents
1.1 Expert-built Lexical Knowledge Bases
1.2 Collaboratively Constructed Knowledge Bases
1.3.1 ISO Lexical Markup Framework
1.3.2 Semantic Web Standards
1.4 Chapter Conclusion
2 Linked Lexical Knowledge Bases
2.1 Combining LKBs for Specific Tasks
2.2 Large-scale LLKBs
2.3 Automatic Linking Involving Wordnets
2.4 Manual and Collaborative Linking
2.5 Chapter Conclusion
3.1 Information Integration
3.1.1 Ontology Matching
3.1.2 Database Schema Matching
3.1.3 Graph Matching
3.2 Evaluation Metrics for WSL
3.3 Gloss Similarity-based WSL
3.3.1 Word Overlap
3.3.2 Vector Representations
3.3.3 Personalized PageRank
3.3.4 Additional Remarks
3.4 Graph Structure-based WSL
3.4.1 Wikipedia Category Alignment
3.4.2 Shortest Paths
3.5 Joint Modeling
3.5.1 Machine Learning Approaches
3.5.2 Unsupervised Approaches
3.6 Chapter Conclusion
4 Fundamental Disambiguation Methods
4.1 Disambiguating Textual Units
4.2 Enhanced Disambiguation Using LLKBs
4.2.1 Approaches
4.2.2 Overview of Work in this Area
4.3 Robust Disambiguation Heuristics
4.4 Sense Clustering
4.4.1 Method
4.4.2 Overview of Work in this Area