Named entity linking on Wikidata in spaCy via OpenTapioca

A spaCy wrapper of OpenTapioca for named entity linking on Wikidata


import spacy nlp = spacy.blank('en') nlp.add_pipe('opentapioca') doc = nlp('Christian Drosten works in Germany.') for span in doc.ents: print((span.text, span.kb_id_, span.label_, span._.description, span._.score)) # ('Christian Drosten', 'Q1079331', 'PERSON', 'German virologist and university teacher', 3.6533377082098895) # ('Germany', 'Q183', 'LOC', 'sovereign state in Central Europe', 2.1099332471902863) ## Check also span._.types, span._.aliases, span._.rank
Author info

Renat Shigapov


Categories models pipeline

Submit your project

If you have a project that you want the spaCy community to make use of, you can suggest it by submitting a pull request to the spaCy website repository. The Universe database is open-source and collected in a simple JSON file. For more details on the formats and available fields, see the documentation. Looking for inspiration your own spaCy plugin or extension? Check out the project idea label on the issue tracker.

Read the docsJSON source