Islamic Resources Open-source Islamic data & tools registry
โ† All resources
quran Scholar reviewed Unmaintained

Quranic Arabic Corpus

Annotated linguistic dataset from the University of Leeds providing word-by-word Arabic grammar, morphology, and syntax for every word in the Quran. Includes part-of-speech tags, roots, lemmas, and syntactic treebank data. The gold standard for Quranic NLP.

License
GPL-3.0
Format
csv
Languages
ar
Free
Yes
Auth required
No
Maintained
No
Attribution
University of Leeds
Added
2026-06-14
quranmorphologygrammarsyntaxword-by-wordnlparabicdataset