Wajdi Zaghouani

University of Quebec at Montreal (UQAM)

Linguistic Department

Office DS-4406

Email : zaghouani.wajdi AT courrier.uqam.ca

 

 Research Interests 

 Publications 

 University Education 

 Language Skills 

 Links 

 

Research Interests

Named entities extraction, Arabic natural language processing, Lexicon creation, Text mining, electronic dictionnaries, Propbank, Machine translation,Text categorisation, Information retrieval, Computational linguistics .

Recent projects

- Quranic Arabic PropBank

- Arabic Pilot Propbank

-Arabic Treebank annotation

-Arabic Part of speech annotation

-Fairuz : The arabic heritage historical dictionary

-The department of Education AlKitaab

 


University Education

2010-2014

Phd candidate in linguistics, University of Quebec at Montreal, Quebec, Canada  

 

2006-2009

Master of linguistics, University of Montreal, Quebec, Canada  

1999-2002

Bachelor degree in computational linguistics, University of Quebec at Montreal, Quebec, Canada

1996 - 1999

D.U.E.L in French language and literature , University of Kairouan, Raqqada, Tunisia   

 

Publications, conferences and Corpora

 

·        Books and book chapters

 

1) Wajdi Zaghouani. 2011.  Le repérage automatique des entités nommées dans la langue arabe.  Book published by Les Éditions universitaires européennes. 156p. ISBN  9786131565953, Germany.

 

2)  J. VÉRONIS, O. HAMON, C. AYACHE, R. BELMOUHOUB, O. KRAIF, D. LAURENT, T.M.H. NGUYEN, N. SEMMAR, F. STUCK, W. ZAGHOUANI. (2008).  La campagne d'évaluation ARCADE II. In Chaudiron, S. & Choukri, K. (Eds.) L'évaluation des technologies de traitement de la langue (pp 47-69). Paris:  Hermes Science Publications, IC2 Cognition Collection.  ISBN 978-2-7462-1992-2.

·        Journals and conferences publications

 

1)      Wajdi Zaghouani. 2012. RENAR: A Rule-Based Arabic Named Entity Recognition System. ACM Trans. Asian Lang. Inf. Process. 11(1): 2 (2012).

 

2)      Wajdi Zaghouani, Abdelati Hawwari and Mona Diab. 2012. A Pilot PropBank Annotation for Quranic Arabic. In Proceedings of the first workshop on Computational Linguistics for Literature, NAACL-HLT 2012, Montreal, Canada.

 

3)      Mohammed Maamouri, Wajdi Zaghouani, Violetta Cavalli-Sforza, Dave Graff and Mike Ciul. 2012. Developing ARET: An NLP-based Educational Tool Set for Arabic Reading Enhancement. In Proceedings of The 7th Workshop on Innovative Use of NLP for Building Educational Applications, NAACL-HLT 2012, Montreal, Canada.

 

4)      Wajdi Zaghouani. Vers la création d'un corpus annoté sémantiquement pour la langue Arabe.2012. To be presented at the Informatique Cognitive 2012 (IC’2012) conference, Montreal, Canada, 6-7 June 2012.

 

5)      Wajdi Zaghouani. 2012. Étude sur la composition des noms de personnes dans la langue arabe. In proceedings of the 25th Journées de linguistique de Laval. 9-11 March 2011, Laval University , Québec, Canada.

 

6)      Wajdi Zaghouani. 2011. Le développement d'un corpus annoté sémantiquement pour la langue arabe. ACFAS 2011, May 9-13 2011, Sherbrooke, QC, Canada.

 

7)      Wajdi Zaghouani. 2011. RENAR : un système de repérage automatique des entités nommées pour la langue arabe. In Traitement automatique des langues : analyses et applications workshop, ACFAS 2011, May 13 2011, Sherbrooke, QC, Canada.

 

8)      Wajdi Zaghouani , Mona Diab , Aous Mansouri, Sameer Pradhan and Martha Palmer.2010.  The Revised Arabic PropBank. In  proceedings of the  4th  Linguistic  Annotation  workshop  ACL  held in  Uppsala. July 15-16 2010.

 

9)      Eric Atwell, Kais Dukes, Abdul-Baquee Sharaf, Nizar Habash, Bill Louw,Bayan Abu Shawar,Tony McEnery,Wajdi Zaghouani,Mahmoud El-Haj. 2010.Understanding the Quran: a new Grand Challenge for Computer Science and Artificial Intelligence. In Grand Challenges in Computing Research for 2010 and beyond. part of  ACM-BCS Visions of Computer Science conference. 13-16 April 2010, Edinburgh University

 

10)  Wajdi Zaghouani, Ralf Steinberger and Bruno Pouliquen.2010. A resource-light Arabic Named Entity Recognition system . Georgetown University Round Table 2010. Arabic Language and Linguistics, March 12 - 14 2010.

 

11)  Mohamed Maamouri, Ann Bies, Seth Kulick, Wajdi Zaghouani, Dave Graff and Mike Ciul.
2010.From Speech to Trees: Applying Treebank Annotation to Arabic Broadcast News.  In Proceedings of LREC 2010, Valetta, Malta, May 17-23, 2010.

 

12)  Wajdi Zaghouani, Bruno Pouliquen, Mohamed Ebrahim and Ralf Steinberger .2010. Adapting a resource-light highly multilingual Named Entity Recognition system to Arabic. In Proceedings of LREC 2010, Valetta, Malta, May 17-23, 2010.

 

13)  Wajdi Zaghouani. 2009. Le repérage automatique des entités nommées dans la langue arabe : vers la création d'un système à base de règles. Master dissertation, University of Montreal. Under the supervision of Dr. Patrick Drouin and Dr. Richard Kittredge.

 

14)  Mona Diab, Aous Mansouri, Martha Palmer, Olga Babko-Malaya,Wajdi Zaghouani, Ann Bies, Mohammed Maamouri. A Pilot Arabic Propbank; LREC 2008, Marrakech, Morocco, May 28-30, 2008.

 

15)  Bruno Pouliquen, Marco Kimler, Ralf Steinberger,  Camelia Ignat, Tamara Oellinger, Ken Blackler, FlavioFuart, Wajdi Zaghouani, Anna Widiger, Ann-Charlotte Forslund, Clive Best (2006). Geocoding multilingual texts: Recognition, Disambiguation and Visualisation. Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC'2006), pp. 53-58. Genoa, Italy, 24-26 May 2006.

 

16)  Yun-Chuang Chiao, Olivier Kraif, Dominique Laurent, Thi Minh Huyen Nguyen, Nasredine Semmar, François Stuck, Jean Véronis, Wajdi Zaghouani (2006). Evaluation of multilingual text alignment systems: the ARCADE II project. Proceedings of the 5th International Conference on Language Resources and Evaluation (LREC'2006). Genoa, Italy, 24-26 May 2006.

 

17)  Pouliquen Bruno, Ralf Steinberger, Camelia Ignat, Irina Temnikova, Anna Widiger, Wajdi Zaghouani & Jan Žižka (2005). Multilingual person name recognition and transliteration. Journal CORELA - Cognition, Représentation, Langage. Numéros spéciaux, Le traitement lexicographique des noms propres. Available online at: http://edel.univ-poitiers.fr/corela/document.php?id=490. ISSN 1638-5748.

 

18)  Pouliquen Bruno, Ralf Steinberger, Camelia Ignat, Irina Temnikova, Wajdi Zaghouani & Jan Žižka “Detection of person names and their translations in multilingual news”, Colloque Traîtement lexicographique des noms propres, Tours, 24 March 2005.

 

19)  Zaghouani Wajdi. “Cognitive sciences & linguistics” , Teluq-UQAM , 23 September 2003

 

20)  Zaghouani Wajdi. “AUTO-ÉVAL : an automatic text evaluation system “, In proceedings of the CESLA colloquim, Montréal, UQAM, 2002.

 

·         Published Corpora

 

1)      Arabic Treebank Part 1 Version 4.1. LDC Catalog, LDC2010T13. 2010. Mohamed Maamouri, Ann Bies, Seth Kulick, Fatma Gaddeche, Wigdan Mekki, Sondos Krouna, Basma Bouziri, Wajdi Zaghouani. Linguistic Data Consortium.

 

2)      Arabic Treebank: Part 2 v 3.1. LDC2011T09. ISBN 1-58563-590-1. 2011. Mohamed Maamouri, Ann Bies, Seth Kulick, Fatma Gaddeche, Wigdan Mekki, Sondos Krouna, Basma Bouziri, Wajdi Zaghouani.  Linguistic Data Consortium

 

3)      Arabic Arabic Treebank: Part 3 v 3.2. 2010. LDC Catalog No. : LDC2010T08. 2010. Mohamed Maamouri, Ann Bies, Seth Kulick, Sondos Krouna, Fatma Gaddeche, Wajdi Zaghouani. Linguistic Data Consortium

 

4)      Arabic Treebank: Part 8 v 1.1. 2010 .LDC Catalog No. LDC2010E11* . 2010. Mohamed Maamouri, Ann Bies, Seth Kulick, Wajdi Zaghouani.  Linguistic Data Consortium

 

5)      Arabic Treebank: Part 9 v 1.0. 2010. LDC Catalog No. LDC2010E19* . 2010. Mohamed Maamouri, Ann Bies, Seth Kulick, Wajdi Zaghouani. Linguistic Data Consortium

 

6)      Arabic Treebank: Part 10 v 1.0 2010. LDC Catalog No. LDC2010E22*. 2010. Mohamed Maamouri, Ann Bies, Seth Kulick, Fatma Gaddeche, Sondos Krouna, Wajdi Zaghouani. Linguistic Data Consortium

 

7)      Arabic Treebank Part 13 V1.0, CatalogID: LDC2011E18*.  Mohamed Maamouri, Ann Bies, Seth Kulick, Sondos Krouna, Dalila Tabassi, Michael Ciul, Wajdi Zaghouan. Linguistic Data Consortium

 

8)      Arabic Treebank Part 14 V2.0 - CatalogID: LDC2011E52*. Mohamed Maamouri, Ann Bies, Seth Kulick, Sondos Krouna, Dalila Tabassi, Michael Ciul, Wajdi Zaghouan .Linguistic Data Consortium

 

9)      Arabic Treebank Part 15 V2.0 - CatalogID: LDC2012E10*. Mohamed Maamouri, Ann Bies, Seth Kulick, Sondos Krouna, Dalila Tabassi, Michael Ciul, Wajdi Zaghouani. Linguistic Data Consortium

 

10)  Arabic Treebank Part 16 V2.0 - CatalogID: LDC2011E116*. Mohamed Maamouri, Ann Bies, Seth Kulick, Sondos Krouna, Dalila Tabassi, Michael Ciul, Wajdi Zaghouani. Linguistic Data Consortium.

* These corpora are not yet released to the Public by the LDC.

·         Reports and unpublished manuscripts

1)      ARABIC PROPBANK ANNOTATION GUIDELINES. 2006. Olga Babko-Malaya, Aous Mansouri, Wajdi Zaghouani. May 2006.

2)      Zaghouani Wajdi. “Evaluation of arabic-english Machine translation systems”, non published manuscript,2004

3)      Sylvie Guillemin-Lanne and Wajdi Zaghouani. 2008. Arabic Named entity recognition. Technical report, Temis, France.

4)      Wajdi Zaghouani and Sylvie Guillemin-Lanne. 2008. The Arabic LUXID Text Mining Skill Cartridge. Technial report, Temis, France.

 

·         Awards:

1)      Joseph-Armand Bombardier Canada Graduate Scholarships Program Doctoral Scholarships (http://www.sshrc-crsh.gc.ca/results-resultats/2011/cgs_docs_2011.pdf)

2)      ATALA student travel grant to participate in the TALN 2010 conference in Montreal.

 


 

Language Skills

Arabic Standard:

Native language.

Tunisian Arabic

Native level

Arabic Dialects:

North African, Egyptian, Levantine and Gulf Arabic : fair level

English:

Good level

French:

Near Native level

Italian:

Good level


Work Experience

     2012- Today : University of Quebec in Montreal

     2006- 2012 : University of Pennsylvania - Linguistic Data Consortium
     2005-2006 : Joint Research center (European Comission)
     2002-2004:  Nstein Technologies (now OpenText)

 

Cities I have visited

http://www.tripadvisor.com/CommunityMapImage?id=26603220&type=TRIPADVISOR&size=LARGE

Links

Arabic NLP resources 1

Arabic NLP resources 2

Arabic online Dictionnary

Arabic NLP discussion forum

Online free computational linguistics classes

Useful general linguistics terms

Arabic names of cities in Tunis

Arabic-english translation online

SFST - Stuttgart Finite State Transducer Tools

Arabic Wordnet

LMF for wordnet

KACST computer research institute

4th global wordnet proceedings

list of arabic roots

University of texas foreign language teaching online class

Arabic language rules (in Arabic)

Islamic Knowledge QA

The International Arab Journal of Information Technology (IAJIT)

The Rali : Applied Research lab in Computational Linguistics at the University of Montreal

The department of Linguistics at the University of Montreal

The Observatoire de linguistique Sens-Texte (OLST) research group

The department of Linguistics at the UQAM

Arabic Grammar classes and references

grammar

http://www.angelfire.com/tx4/lisan/arabic201.htm

semantics

http://www.angelfire.com/tx4/lisan/lex_zam/dilalahessays/mujamdilalah.htm

book1

book2

Refernces on semantic role labelling

http://www.cairn.info/revue-langages-2009-4.htm

     http://www.erudit.org/revue/rql/1991/v20/n1/?resume=1