scikit

GoldCorpus
class
v2.0 This feature is new and was introduced in spaCy v2.0
An annotated corpus, using the JSON file format.

This class manages annotations for tagging, dependency parsing and NER.

GoldCorpus.__init__
method

Create a GoldCorpus.

NameTypeDescription
trainunicode or Path or iterable Training data, as a path (file or directory) or iterable. If an iterable, each item should be a (text, paragraphs) tuple, where each paragraph is a tuple (sentences, brackets),and each sentence is a tuple (ids, words, tags, heads, ner). See the implementation of gold.read_json_file for further details.
devunicode or Path or iterableDevelopment data, as a path (file or directory) or iterable.
returnsGoldCorpusThe newly constructed object.