spaCy currently supports the following languages and capabilities:
Chinese tokenization requires the Jieba library. Statistical models are coming soon.
Work has started on the following languages. You can help by improving the existing language data and extending the tokenization patterns.