- 1. Overview
- 2. Etymology
- 3. Cultural Impact
BabelNet
BabelNet is a multilingual lexical‑semantic knowledge graph , ontology and encyclopedic dictionary that was developed at the NLP group of the Sapienza University of Rome under the supervision of Roberto Navigli . [1] [2] It was automatically created by linking Wikipedia to the most popular computational lexicon of the English language , WordNet . The integration was performed using an automatic mapping and by filling in lexical gaps in resource‑poor languages through the use of statistical machine translation . The result is an encyclopedic dictionary that provides concepts and named entities lexicalized in many languages and interconnects them with a large number of semantic relations . Additional lexicalizations and definitions are contributed by linking to freely licensed wordnets, OmegaWiki , the English Wiktionary , Wikidata , FrameNet , VerbNet and other resources. In analogy to WordNet , BabelNet groups words in different languages into sets of synonyms called Babel synsets . For each Babel synset the system supplies short glosses in numerous languages that are harvested from both WordNet and Wikipedia .
Statistics of BabelNet
As of December 2023 [update] , BabelNet (version 5.3) covers 600 languages . It contains almost 23 million synsets and around 1.7 billion word senses (regardless of their language). Each Babel synset contains, on average, two synonyms per language, i.e., two word senses . The semantic network includes all lexico‑semantic relations from WordNet (hypernymy and hyponymy , meronymy and holonymy , antonymy and synonymy , etc.), amounting to roughly 364 000 relation edges, as well as an underspecified relatedness relation extracted from Wikipedia (approximately 1.9 billion edges). [1] Version 5.3 also links around 61 million images to Babel synsets and provides a Lemon RDF encoding of the resource, [3] which is accessible through a SPARQL endpoint . Additionally, 2.67 million synsets are assigned domain labels .
Applications
BabelNet has been demonstrated to empower a variety of multilingual natural language processing tasks. Its richly lexicalized knowledge yields state‑of‑the‑art performance in several areas, including:
- Semantic relatedness , [4] [5]
- Multilingual word‑sense disambiguation and entity linking , most notably through the Babelfy system, [7]
- Game with a purpose initiatives that harness player interaction for linguistic annotation, [8]
Prizes and acknowledgments
The resource has received several distinctions, among them the META prize 2015 for “groundbreaking work in overcoming language barriers through a multilingual lexicalised semantic network and ontology making use of heterogeneous data sources”. The Artificial Intelligence Journal article that introduced BabelNet [1] was awarded the Prominent Paper Award in 2017. [9] Moreover, BabelNet was highlighted in a Time magazine feature [10] that discussed innovative lexical knowledge resources on the Web.
See also
- Babelfy
- EuroWordNet
- Knowledge acquisition
- Linguistic Linked Open Data
- Semantic network
- Semantic relatedness
- Wikidata
- Wiktionary
- Word sense disambiguation
- Word sense induction
- UBY