quran Scholar reviewed Unmaintained
Quranic Arabic Corpus
Annotated linguistic dataset from the University of Leeds providing word-by-word Arabic grammar, morphology, and syntax for every word in the Quran. Includes part-of-speech tags, roots, lemmas, and syntactic treebank data. The gold standard for Quranic NLP.
- License
- GPL-3.0
- Format
- csv
- Languages
- ar
- Free
- Yes
- Auth required
- No
- Maintained
- No
- Attribution
- University of Leeds
- Added
- 2026-06-14