The Hindi-Urdu Treebank
The Hindi-Urdu Treebank consists of three levels:
the dependency structure, the Prop Bank, and the phrase structure. Currently the dependency structure has
been annotated, Prop Bank annotation is on its way, and
we are working on automatically deriving the phrase structure from the dependency structure.
Rajesh Bhatt; Bhuvana Narasimhan; Martha Palmer; Owen Rambow; Dipti Sharma; Fei Xia, A Multi-Representational and Multi-Layered Treebank for Hindi/Urdu, In the Proceedings of the Third Linguistic Annotation Workshop, held in conjunction with ACL-IJCNLP 2009, Singapore, August, 2009. (http://aclweb.org/anthology-new/W/W09/W09-3036.pdf)
Martha Palmer, Rajesh Bhatt, Bhuvana Narasimhan, Owen Rambow, Dipti Misra Sharma, Fei Xia, Hindi Syntax: Annotating Dependency, Lexical Predicate-Argument Structure, and Phrase Structure, In the Proceedings of the 7th International Conference on Natural Language Processing, ICON-2009, Hyderabad, India, Dec 14-17, 2009 (http://ltrc.iiit.ac.in/icon_archives/ICON2009/Papers/pdf/28.pdf)
Archna Bhatia, Rajesh Bhatt, Bhuvana Narasimhan, Martha Palmer, Owen Rambow, Dipti Misra Sharma, Michael Tepper, Ashwini Vaidya, Fei Xia, Empty Categories in a Hindi Treebank, to appear in the Proceedings of the 7th International Conference on Language Resources and Evaluation (LREC'10), Valletta, Malta, May, 2010.
Jena D. Hwang, Archna Bhatia, Claire Bonial, Aous Mansouri, Ashwini Vaidya, Nianwen Xue, and Martha Palmer. 2010.
"PropBank Annotation of Multilingual Light Verb Constructions."
Proceedings of the Linguistic Annotation Workshop held in conjunction with ACL-2010. Uppsala, Sweden, July 15-16, 2010.
Department of Linguistics
150 Hicks Way
The University of Massachusetts
Amherst, Massachusetts 01003-9274, USA
Office: 224 South College