A pilot PropBank annotation for quranic Arabic

  • Wajdi Zaghouani
  • , Abdelati Hawwari
  • , Mona Diab

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The Quran is a significant religious text written in a unique literary style, close to very poetic language in nature. Accordingly it is significantly richer and more complex than the newswire style used in the previously released Arabic PropBank (Zaghouani et al., 2010; Diab et al., 2008). We present preliminary work on the creation of a unique Arabic proposition repository for Quranic Arabic. We annotate the semantic roles for the 50 most frequent verbs in the Quranic Arabic Dependency Treebank (QATB) (Dukes and Buckwalter 2010). The Quranic Arabic PropBank (QAPB) will be a unique new resource of its kind for the Arabic NLP research community as it will allow for interesting insights into the semantic use of classical Arabic, poetic literary Arabic, as well as significant religious texts. Moreover, on a pragmatic level QAPB will add approximately 810 new verbs to the existing Arabic PropBank (APB). In this pilot experiment, we leverage our knowledge and experience from our involvement in the APB project. All the QAPB annotations will be made freely available for research purposes.

Original languageEnglish
Title of host publicationWorkshop on Computational Linguistics for Literature, CLfL 2012 - Co-located with the 2012 Conference of the North American Chapter of the Association for Computational Linguistics
Subtitle of host publicationHuman Language Technologies, NAACL-HLT 2012 - Proceedings
EditorsDavid K. Elson, Anna Kazantseva, Rada Mihalcea, Stan Szpakowicz
PublisherAssociation for Computational Linguistics (ACL)
Pages78-83
Number of pages6
ISBN (Electronic)1937284204, 9781937284206
Publication statusPublished - 2012
Externally publishedYes
Event2012 Workshop on Computational Linguistics for Literature, CLfL 2012 - Montreal, Canada
Duration: 8 Jun 2012 → …

Publication series

NameWorkshop on Computational Linguistics for Literature, CLfL 2012 - Co-located with the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT 2012 - Proceedings

Conference

Conference2012 Workshop on Computational Linguistics for Literature, CLfL 2012
Country/TerritoryCanada
CityMontreal
Period8/06/12 → …

Fingerprint

Dive into the research topics of 'A pilot PropBank annotation for quranic Arabic'. Together they form a unique fingerprint.

Cite this