This package wraps the fast and efficient UDPipe language-agnostic NLP pipeline (via its Python bindings), so you can use UDPipe pre-trained models as a spaCy pipeline for 50+ languages out-of-the-box. Inspired by spacy-stanza, this package offers slightly less accurate models that are in turn much faster.
import spacy_udpipe spacy_udpipe.download("en") # download English model text = "Wikipedia is a free online encyclopedia, created and edited by volunteers around the world." nlp = spacy_udpipe.load("en") doc = nlp(text) for token in doc: print(token.text, token.lemma_, token.pos_, token.dep_)
Submit your project
If you have a project that you want the spaCy community to make use of, you can suggest it by submitting a pull request to the spaCy website repository. The Universe database is open-source and collected in a simple JSON file. For more details on the formats and available fields, see the documentation. Looking for inspiration your own spaCy plugin or extension? Check out the
project idea label on the issue tracker.