Islamic Resources Open-source Islamic data & tools registry
← All resources
general

PyArabic

Python library for Arabic text manipulation and processing. Covers diacritic (tashkeel) handling, tokenization, normalization, transliteration, and text comparison. Essential preprocessing tool for Islamic text analysis and Quran/hadith NLP pipelines.

License
GPL-2.0
Format
pip
Languages
ar
Free
Yes
Auth required
No
Maintained
Yes
Attribution
Taha Zerrouki
Added
2026-06-14
arabicnlptashkeeltokenizationnormalizationpythonpreprocessing