Dr. Girish Nath Jha
Associate Professor, Computational Linguistics
Special Centre for Sanskrit Studies
Jawaharlal Nehru University
New Delhi-110067
Phone(O) : 26741308, 26738834
Fax : 26741308
Email  : girishjha@gmail.com , girishjha@jnu.ac.in 
URL : http://sanskrit.jnu.ac.in

Education

  M.A:      1991, Linguistics, JNU, New Delhi

M.Phil:   1993, Linguistics (Computational Morphology), JNU, New Delhi

Ph.D:     1996, Linguistics (Machine Translation), JNU, New Delhi

M.A:       1999, Linguistics (Natural Language Interface),University of Illinois, Urbana-Champaign, USA

Research Interest

Indian languages corpora and standards, Sanskrit and Hindi linguistics, Science & Technology in Sanskrit texts, Computational Lexicography, Machine Translation, e-learning, web based technologies, RDBMS techniques, software design and localization

Employment History

  • Associate Professor, Computational Linguistics,Special Center for Sanskrit Studies, J.N.U (2008 - )
  • Assistant Professor, Computational Linguistics, Special Center for Sanskrit Studies, J.N.U (2002-2008)
  • Software Development Specialist, Human Kinetics, Champaign, Illinois, USA (2001 – 2002)
  • Software Engineer, Collaborative Technologies (renamed dChain Commerce), Champaign, Illinois, USA (1999 – 2001)
  • Research Assistant, Beckman Institute for Advanced Science and Technology, University of Illinois, Urbana-Champaign, USA (1998 - 1999)
  • Research Assistant, Dept. of Psychology, University of Illinois, Urbana-Champaign, USA (Summer, 1997)
  • Teaching Assistant, Dept. of Linguistics, University of Illinois, Urbana-Champaign, USA (1996-1998)

Guest/Visiting Faculty

Adjunct Faculty, Center for Linguistics, School of Language Literature & Culture Studies, Jawaharlal Nehru University

Short-term guest Lectureship at University of Wurzburg, Germany to teach a short course in Computational Linguistics at the Digital Humanities department under a DAAD fellowship during Jan, 2014

Short-term guest Lectureship at Yogyakarta State University, Indonesia during March, 2013 to teach a one week course on Computational Linguistics to the faculty of the University.

Mukesh and Priti Chatter Distinguished Professor of History of Science, University of Massachusetts Dartmouth, USA (2009-2012)

Consultancy

  • EZ Data Intelligence consultancy for research in smart text processors for medical domain, 2013-
  • Microsoft Research, Redmond, USA  for Machine Translation for Indian Languages,  July - August 2012
    Microsoft releases English–Urdu Machine Translation system in partnership with Special Center for Sanskrit Studies, JNU
  • Linguistic Data Consortium (LDC), University of Pennsylvania, Philadelphia, USA for narrow band data for language identification by computers, January - April 2011
  • Center of Indic Studies, University of Massachusetts, Dartmouth, USA, for exploring scientific texts in Sanskrit, 2010-12
  • Microsoft Research India Lab, Bangalore for their POS tagging project for Indic languages,  November 2006-January 2007
  • Microsoft India on their project on Handwriting Recognition for Devanagari, February – June 2006
  • Mahatma Gandhi International Hindi University, Wardha (then New Delhi)on their Database system for dialects of Hindi  project (2002-2003)

Invited Lectures/Talks (110)

  1. "Resource poverty to resource hunting - the case of Indian languages". delivered at a national seminar on "Towards standardizing Khasi for
    Computational Purposes", St Anthony's College, Shillong, Oct 28, 2014
  1. "Languages Technology and Education: India and Indonesia", Talk delivered at Yogyakarta State University, in an Exploratory Seminar in
    JNU-YSU collaboration, Oct 10, 2014
  1. "Preserving and promoting Sanskrit Manuscripts - Digitization and beyond", Talk delivered in "National Workshop on Research Methodology
    in Sanskrit" at Rashtriya Sanskrit Sansthan, Agartala, Sept 20, 2014
  1. "Information Technology innovations and fundamental sciences in Sanskrit - : A multidisciplinary approach to research", Talk delivered in "National Workshop on Research Methodology in Sanskrit" at Rashtriya Sanskrit Sansthan, Agartala, Sept 19, 2014
  1. "Language Technology, corpora and standards for Indian languages", Talk delivered at the Center for Indic Studies, University of Massachusetts, Dartmouth, Aug 29, 2014
  1. "Sanskrit in the school curriculum – looking ahead", DPS R K Puram, August 14, 2014
  1. "Sanskrit in the age of IT – the age of innovation", ARSD College, Delhi University, August 14, 2014
  1. "ICT applications and innovations for Sanskrit - a practical approach", NCERT, New Delhi, Aug 8, 2014
  1. "NLP and Indian Languages", at the Refresher Course in Computer Sciences and Information Technology at Academic Staff College, JNU, July 28, 2014
  1. "Computational Linguistics for Sanskrit: a viable research option", talk delivered at the CPDHE, Delhi University, July 10, 2014
  1. "Knowledge representation in man and machine - philosophy and beyond", Dept of Linguistics, Gujarat University, Ahmadabad, 21 March, 2014
  1. Sanskrit Computational Linguistics: Opportunities and Challenges, talk delivered at Shri Lal Bahadur Rashtriya Sanskrit Vidyapeeth, Delhi, February 25, 2014
  1. Language Technology Resources and challenges in emerging India, Dept of Letters & Philosophy, University of Florence, Italy, Jan 22, 2014
  1. Making most of diversity – Language Technology in India, German Center for Artificial Intelligence (DFKI), Berlin, Germany, Jan 20, 2014
  1. Corpora, Standards and Language Technology – a tale of emergent India, Wurzburg Center for Modern India, University of Wurzburg, Germany, Jan 16, 2014
  1. Language diversity in India - a challenge for resource creation, German Center for Artificial Intelligence (DFKI), Saarbrucken, Germany, Jan 13, 2014
  1. Indian Languages, Unicode and Automatic Transliteration, Institute of Linguistics, Adam Mickiewicz University, Poznan, Poland, Dec 12, 2013
  1. Computer Processing and Natural Language Engineering of Indian Languages: the Case study of Sanskrit - II, Faculty of Mathematics and Computer Science, Adam Mickiewicz University, Poznan, Poland, Dec 11, 2013
  1. Computer Processing and Natural Language Engineering of Indian Languages: the Case study of Sanskrit - I, Faculty of Mathematics and Computer Science, Adam Mickiewicz University, Poznan, Poland, Dec 10, 2013
  1. Language Technology Resource (LTR) development and India, invited talk in UGC Sponsored One Week National Workshop on Multilingual Technologies, Punjabi University, Patiala, Nov 12, 2013
  1. Science And Technology In Ancient Indian Texts (SATIAIT) - the latest report, presented at the Dept of Chemistry, University of Massachusetts, Dartmouth, July 18, 2013
  1. Information Technology for Indian Language and Literature, invited talk at Chandigarh Sahitya Academy, May 25, 2013
  1. Language Technology and Empowerment, presented at the international seminar on "Language and Power: Perspectives, Issues and Impact", April 6-7, 2013, Special Center for Sanskrit Studies, J.N.U.
  1. Computational Linguistics, corpora and standards: the big picture, invited talk delivered at the Nepali Dept, North Bengal University, Siliguri, April 4, 2013
  1. Newer dimensions of Sanskrit Research in the Information Technology Age, invited talk in Shri Lal Bahadur Shastri Rashtriya Sanskrit Vidyapeeha, March 6, 2013
  1. Sanskrit in the Information Technology Age, invited talk at Daulat Ram College, University of Delhi, March 2, 2013
  1. Sanskrit shodh ki nai dishaen, invited talk at Haryana Sanskrit Academy, Bhiwani, Haryana Feb 10, 2013
  1. Technology for Indian Heritage: the case of ancient scripts and texts, invited talk in the National Seminar on "Application of Information Technology for conservation, editing and publication of Manuscripts", Samskriti Foundation, Bangalore, 21-22 Jan 2013
  1. Language Technology Resource (LTR) Developments at JNU,  invited talk at Academic Staff College, JNU, July 31, 2012
  1. Resource creation for Indian languages - an update, invited talk at a workshop on "Reinforcing International Collaboration on Language Resources and Evaluation", joint COCOSDA-FLaReNet workshop, LREC, Istanbul, May 26, 2012
  1. Emerging trends in Language Technology Resources (LTR) in India, delivered at Microsoft Research, Redmond, USA, Apr 9, 2012
  1. Science and Technology initiatives for Indian Languages: Impacting India through US and indigenous expertise, delivered at an International seminar on "Presence of America in India", University of Illinois, Urbana-Champaign, IL, USA, Apr 7, 2012
  1. "Sanskrit e-learning", invited paper in a national seminar on education at Shri Lal Bahadur Rashtriya Sanskrit Vidyapeeth, Delhi, Feb17, 2012
  1. Sanskrit shodh ki naii dishaaen, talk delivered at Shri Lal Bahadur Rashtriya Sanskrit Vidyapeeth, Delhi, Feb10, 2012
  1. Newer areas of R & D in Sanskrit at JNU: a report, DPS Society, Dwarka, Jan18, 2012
  1. Computational Linguistics Corpora and standards: the big picture, distinguished lecture at JNU lecture series, Feb 3, 2012
  1. Computational Linguistics Corpora and standards: the Indian situation, talk delivered at IIT Patna, Jan30, 2012
  1. Enabling Indian languages on computers, Workshop on Indian language computing, Mohini Devi Memorial School, Araria, 13-16 Jan, 2012
  1. "Indian languages corpora Initiative: current status and future diections", inaugural talk at ILCI workshop organized by IIT Mumbai, Jan3-4, 2012
  1. "Promoting Sanskrit through Science and Technology: A report on the latest developments at JNU", 20th International Vedanta Congress, JNU, Dec28-31, 2011
  1. Introduction to programming in Java, , NLP Lecture Series at Kashmir University, sponsored by LDC-IL, CIIL Mysore, Nov8-9, 2011
  1. "Linguistic Theory and Grammar Formalisms", NLP Lecture Series at Kashmir University, sponsored by LDC-IL, CIIL Mysore, Nov8-9, 2011
  1. "Science & Technology in Sanskrit at JNU: A report", Convergence of Science and Spirituality, Nov 5, 2011, organized by ISSKCON, Sri Venkateswara College, South Campus Delhi University)
  1. "Issues in POS tag-set design for Indian languages", Sept19, 2011, Inaugural talk in the "Workshop on POS-Morph: Urdu-Hindi& Maithili", organized by LDC-IL, CIIL Mysore, 19-21 Sept, 2011, CIIL Mysore
  1. Computational Linguistics and Sanskrit, lecture delivered at International Sanskrit conference at Uttaranchal Sanskrit University, Haridwar, 11th Sept, 2011
  1. "Purushartha and Yoga: a no-contradiction approach", invited talk in the "Summer Workshop on Dharmic Traditions", Center for Indic Studies, University of Massachusetts, Dartmouth, July19 & July26, 2011
  1. "Modern Hindu Practices and their evolution", invited talk in the "Summer Workshop on Dharmic Traditions", Center for Indic Studies, University of Massachusetts, Dartmouth, July19, 2011
  1. "What is dharma: following a textual strategy", invited talk in the "Summer Workshop on Dharmic Traditions", Center for Indic Studies, University of Massachusetts, Dartmouth, July18, 2011
  1. "Why Study Sanskrit", talk delivered in "Trindic Corps Youth Camp", Organized by Center of Indic Studies and 3R Foundation, University of Massachusetts, Dartmouth, July1-4, 2011
  1. Computational Linguistics, Talk delivered at Hindu College, Dec 21, 2010
  1. Sanskrit and Computer, lecture delivered at NCERT teachers orientation, Dec 1, 2010
  1. Language Technology Resources for Sanskrit and other Indian Languages at Jawaharlal Nehru University, India, LDC Institute, University of Pennsylvania, July 22, 2010
  1. Multilingual Multimodal resources – creating a leveling field in India, COCOSDA/WRITE/FLaReNet workshop “language Technology Issues for International cooperation”, LREC2010, Malta, 22 May, 2010
  1. Indian Languages Corpora and Application Development at JNU, Prof R N Shrivastava Memorial Lecture, Kendriya Hindi Sansthan, Agra, March 23, 2010
  1. Language Processing and Corpus Linguistics, Prof R N Shrivastava Memorial Lecture, Kendriya Hindi Sansthan, Agra, March 22, 2010
  1. Language Technology & Sanskrit, Dept of Sanskrit, Hansraj College, Delhi University, Feb 18, 2010
  1. Computational Linguistics for Sanskrit, PhD research workshop, Sri Lal Bahadur Shastri Rashtriya Sanskrit Vidyapeeth, New Delhi, Feb 16, 2010
  1. A national effort to develop technologies for multiple national languages the experience of India, FLaReNet (Fostering Language Resource Network), funded by European Commission, Barcelona, Spain, Feb 12, 2010
  1. International Collaboration: Europe and India, FLaReNet (Fostering Language Resource Network), funded by European Commission, Barcelona, Spain, Feb 12, 2010
  1. Why should you consider Sanskrit, Sanskrit Dept, Mothers School, Jan 22, 2010
  1. Hamaare gaon, shiksha aur computer, Kuruom Vidyalaya: Inauguration and Educational Workshop, Jan 14, 2010, Kuruom, Sultanpur, UP
  1. Web based technologies, Web Creations, Lucknow, Jan 13, 2010
  1. Linguistics for Sanskrit, “Workshop on Sanskrit and Computers: Getting equipped to face the new challenges" organized jointly by the Department of Sanskrit Studies, University of Hyderabad, and the Sanskrit Academy, Osmania University, Hyderabad, 20 Dec 2009
  1. Java for Computational Linguistics, “Workshop on Sanskrit and Computers: Getting equipped to face the new challenges" organized jointly by the Department of Sanskrit Studies, University of Hyderabad, and the Sanskrit Academy, Osmania University, Hyderabad, 21-23 Dec 2009
  1. Computational linguistics R&D at JNU, “Workshop on Sanskrit and Computers: Getting equipped to face the new challenges" organized jointly by the Department of Sanskrit Studies, University of Hyderabad, and the Sanskrit Academy, Osmania University, Hyderabad, 23 Dec 2009
  1. Computational Linguistics and TDIL Programme, panel discussion on “Getting less-resourced languages on board - which languages, which resources, which methods, which funding?", 4th International Language and Technology Conference (4th LTC), Poznan, Poland, Nov 6, 2009
  1. Indian Language Corpora Initiative (ILCI), invited talk, 4th International Language and Technology Conference (4th LTC), Poznan, Poland, Poznan, Poland, Nov 6, 2009
  1. Sanskrit and Computer,  Sanskrit Dept, Gargi College, Delhi University, Sept 17,2009
  1. Documentation initiatives for Sanskrit and other Indian languages, International seminar on language documentation, Center for Linguistics, School of Languages Literatures & Culture Studies, JNU, Oct 27, 2009
  1. JNU Text Encoding & Search for Sanskrit (J-TESS), Rashtriya Sanskrit Sansthan, Delhi, Oct 26, 2009
  1. Issues in Sanskrit NLP and e-learning, Kanya Mahavidyalaya, Jallandhar, Aug 7,2009
  1. Sanskrit aur Kampyutar, DPS HRD Center, Dwarka, New Delhi, Apr 15, 2009
  1. Emerging trends in Language Technology and Sanskrit Computing at the Science Festival 2009, organized by School of Life Sciences, J.N.U., Feb 27, 2009
  1. Methods and Issues in Computer Processing of Panini,  Bhandarkar Oriental Research Institute, Pune, Feb22, 2009
  1. Introduction to Language Technology, Center of Indian Languages, SLL&CS, JNU, Jan 23, 2009
  1. Introduction to Natural Language Processing, refresher course for Computer Science, Academic Staff College, JNU, Nov 14, 2008
  1. Demonstration of Sanskrit software developed at JNU (included as part of Sanskrit CD released by Ministry of IT and CDAC Pune, Nov 25), Tirupati Sanskrit Vidyapeeth
  1. Multimedia e-learning for Sanskrit, at DPS, HRD Center, Dwarka, New Delhi, Nov 16, 2008
  1. Issues in Sanskrit NLP and e-learning, Dept of Sanskrit, Pali and Prakrit, MD University Rohtak, Nov 7, 2008
  1. Sanskrit and Computer, Haryana Sanskrit Academy at Bhiwani, 28/9/2008
  1. Plenary speech in the international  workshop titled  Anaphora Resolution II,  organized by Bergen University, Bergen, Norway, Aug29-31, 2008
  1. Lecture series on NLP and Panini, Dept of Vyakarana, Faculty of Arts, Banaras Hindu University (BHU), March 25-26, 2008
  1. Teaching through e-learning, Central Institute of Higher Tibetan Studies, Sarnath, Dec29-30,2007
  1. Multilingualism, localization and language computing, IP Engineering College, Gaziabad, Nov 17, 2007
  1. Invited speech  at First International Symposium on Sanskrit Computational Linguistics, INRIA, at Paris-Rocquencourt, 29 Oct-31 Oct 2007
  1. Sanskrit Computational Linguistics at Sanskrit Shodh Parishad , Delhi University, August 1, 2007
  1. Lecture series on Language Technology at the NLSI summer training, CIIL Mysore, June 18-19, 2007
  1. Computational Linguistics,  Dept of Linguistics, Kashmir University, Srinagar, Feb23, 2007
  1. Lecture-series in the area of Sanskrit Computational Linguistics at the Dept of NLP and Shabdabodha, Rashtriya Sanskrit Vidyapeetham, Tirupati, Andhra Pradesh, Jan25-27, 2007
  1. Computational Phonology and Morphology, PGDLT course at CDAC, Noida, Jan 5, 2007
  1. Language Technology. PGDLT course at CDAC, Noida, Dec 21, 2006
  1. Lecture series on Language Technology , MA course on Language Technology at Mahatma Gandhi International Hindi University, Wardha, 11-15 Dec, 2006
  1. Sanskrit and computer, Haryana Sanskrit Academy at Panchkula, Chandigarh, Nov 26, 2006
  1. Hindi Lexicography and computer, at Central Hindi Institute, Agra, at the workshop on “Hindi dialect dictionary” , Oct 16,2006
  1. Sanskrit and Computer, Golden Jubilee Celebration of PGDAV College, Delhi University (organized by Delhi Sanskrit Academy, Sept 29, 2006
  1. Invited talk at Summit on Internationalization of Web at Hotel Grand Ashoka, Bangalore (24-25 August, 2006) organized by World Wide Web Consortium (W3C) and Manufacturers’ Association of Information Technology (MAIT)
  1. Computational Linguistics: an overview, CDAC Kolkata, July 6, 2006
  1. Linguistic theories and formal grammar models, CDAC Kolkata, July 5, 2006
  1. Paninian Formalism for NLP,  CDAC Kolkata, July 4, 2006
  1. Invited talk at the First National Symposium on Modeling and Shallow Parsing of Indian Languages(MSPIL-06) 31st March - 4th April, 2006 at IIT Bombay
  1. Language and computers, ASC Orientation Course, Rajdhani College, Delhi University,  Dec 27,2005
  1. Invited speech at the National Seminar on Translation Today: state and issues, Deptt. of Linguistics, University of Kerala, Trivandrum, August 23-25, 2005
  1. Machine Translation in the MA course on Translation Technology at Mahatma Gandhi Antarrashtriya Hindi University, Wardha, April11-16, 2005
  1. Computational aspects of Indian grammatical tradition at the 4th course on IGT organized by CIIL,Mysore at NRLC, Patiala, Feb2005
  1. Computational aspects of Panini at the Refresher course on Sanskrit Linguistics, organized by CIIL,Mysore at Deptt of Linguistics , Punjabi University, Patiala and at the NRLC, Patiala, Feb-March 2003
  1. Computational linguistics,  e-learning at the Orientation course on Information technology at Academic Staff College, JNU, August 2004
  1. Computational linguistics, e-learning at the Orientation course on Information technology at Academic Staff College, JNU, December 2004
  1. Macromedia ColdFusion for dynamic web content at UNESCO sponsored refresher course at School of Information Technology, JNU, September, 2003
  1. Lectures on general linguistics,  Kendriya Hindi Sansthan, Delhi: (1995-1996)
  1. Lectures in UGC sponsored computer courses, Shri Lal Bahadur Shastri Rashtriya Sanskrit Vidyapeeth: (1995-1996)

Keynote/Plenary talks

Information Technology and emerging India – a case of languages and literatures, Keynote speech in "13th Melus-Melow Conference", Dept of English, Jammu University, Jammu, 21 Feb, 2014

Language Technology in India: empowering millions, plenary talk in the "6th Language & Technology Conference:Human Language Technologies as a Challenge for Computer Science and Linguistics" , December 8, 2013, Poznan, Poland

Delivered a keynote talk titled Current progress in developing Sanskrit and other linguistic resources, in International WAVES Conference 2012, University of Massachusetts, Dartmouth, MA, USA, July14, 2012

"Issues in POS annotation and limited crowd sourcing - the case of ILCIANN", keynote talk in a national seminar on "POS Annotation for Indian languages: Issues & Perspectives, organized by LDC-IL, CIIL Mysore, Dec 12-13, 2011

Computational Linguistics and Sanskrit, keynote talk delivered at a workshop on "Computational Sanskrit" at SD College Ambala, 11th Nov, 2011

Research papers/articles(109)

  1. "Sanskrit manuscripts digitization and beyond : the current status in India", presented at WAVES-2014, Maharishi University of Management,
    Fairfield, Iowa, USA, Aug 3, 2014
  1. "Issues in mapping of Sanskrit-Hindi verb forms", Nripendra Kumar Pathak, Girish Nath Jha, proceedings of the "2nd Workshop on Indian Language Data Resource and Evaluation" held under LREC at Reykjavik, Iceland, 26-31 May 2014
  1. "Evaluation of Hindi-English MT Systems", Atul Kr Ojha, Akanskha Bansal, Sumedh Hadke, Girish Nath Jha, proceedings of the "2nd Workshop on Indian Language Data Resource and Evaluation" held under LREC at Reykjavik, Iceland, 26-31 May 2014
  1. "Issues in chunking parallel corpora: mapping Hindi-English verb group in ILCI", Esha Banerjee, Akansha Bansal, Girish Nath Jha, proceedings of the "2nd Workshop on Indian Language Data Resource and Evaluation" held under LREC at Reykjavik, Iceland, 26-31 May 2014
  1. "Mapping Indian Languages onto the IMAGACT Visual Ontology of Action", poster by Massimo Moneglia, Susan W. Brown, Aniruddha Kar, Anand Kumar, Atul Kumar Ojha, Heliana Mello, Niharika Mishra, Girish Nath Jha, Bhaskar Ray, Annu Sharma, proceedings of the "2nd Workshop on Indian Language Data Resource and Evaluation" held under LREC at Reykjavik, Iceland, 26-31 May 2014
  1. "Challenges in Indian Language Transliteration: a case of Devanagari, Bangla and Manipuri", poster by K Kabi Khanganba and Girish Nath Jha, proceedings of the "2nd Workshop on Indian Language Data Resource and Evaluation" held under LREC at Reykjavik, Iceland, 26-31 May 2014
  1. "Indian languages on the TypeCraft platform – the case of Hindi and Odia", poster by Girish Nath Jha, Lars Hellan, Dorothee Beermann, Srishti Singh, Pitambar Behera and Esha Banerjee, proceedings of the "2nd Workshop on Indian Language Data Resource and Evaluation" held under LREC at Reykjavik, Iceland, 26-31 May 2014
  1. Semantic relations in Panini's karaka system: challenges for computer processing, Explorative Workshop: by Jawaharlal Nehru University and University of Zurich on "Relationships of Possession: Linguistic, Literary, Philosophical and Legal Perspectives", 17-19 February 2014, JNU, New Delhi
  1. Computer Processing of South Asian Languages - the case of Sanskrit (presented by Sachin Kumar and Diwakar Mishra), in Special session on, "Recent Developments in Sanskrit Computational Linguistics", at the 30th Round Table of South Asian Language Analysis, Department of Sanskrit Studies, School of Humanities, University of Hyderabad, 8th Feb, 2014
  1. A Generic Search for Heritage Texts, Kumar Nripendra Pathak, Girish Nath Jha in Human Language Technologies as a Challenge for Computer Science and Linguistics (proceedings of the 6th LTC), Zygmunt Vetulani & Hans Uszkoreit (ed), pp 504-508, Fundacja, Uniwersytetu im. A. Mickiewicza, Poznan, Poland, 2013
  1. Linking and Referencing Multi-lingual corpora in Indian languages, Esha Banerjee, Shiv Kaushik, Pinkey Nainwani, Akanksha Bansal, Girish Nath Jha in Human Language Technologies as a Challenge for Computer Science and Linguistics (proceedings of the 6th LTC), Zygmunt Vetulani & Hans Uszkoreit (ed), pp 65-68, Fundacja, Uniwersytetu im. A. Mickiewicza, Poznan, Poland, 2013
  1. Corpora creation for Indian Language Technologies – the ILCI project, Akanksha Bansal, Esha Banerjee, Girish Nath Jha in Human Language Technologies as a Challenge for Computer Science and Linguistics (proceedings of the 6th LTC), Zygmunt Vetulani & Hans Uszkoreit (ed), pp 253-257, Fundacja, Uniwersytetu im. A. Mickiewicza, Poznan, Poland, 2013
  1. Syllabification and stress assignment in phonetic Sanskrit text, Diwakar Mishra, Kalika Bali, Girish Nath Jha, poster presentation at International Oriental COCOSDA 2013 conference, KIIT, Gurgaon, India, Nov 27, 2013
  1. Lexical Database System for Accessing India's Heritage Texts (With special reference to Ayurveda), with Baldev Ram Khandoliyan , published in "Lexicography and Dictionaries in the Information Age" (Selected papers from the "8th ASIALEX International Conference", Bali, Indonesia, 20-22 August 2013), edited by Deny A. Kwary, Nur Wulan and Lilla Musyahda, ISBN 978-602-7924-20-8, pp157-165
  1. Annotating Sanskrit corpus – adapting IL-POSTS, Girish Nath Jha, Madhav Gopal, Diwakar Mishra in Human Language Technology: Challenges for Computer Science and Linguistics (revised and selected papers from 4th LTC), pp 371-379, Zygmunt Vetulani (ed), Springer Verlag, Germany, 2009
  1. "Digital libraries, electronic access and search for texts of Vedantic tradition" in the 21st International Congress of Vedanta, University of Massacusetts, Dartmouth, July 11-14, 2013
  1. Anaphora Resolution in Sanskrit: Issues and Challenges, jointly with Madhav Gopal, presented in "Sanskrit syntax and discourse structures" , international seminar organized by University of Paris Diderot, June 13-15, 2013
  1. Text Normalizer for Sanskrit, jointly with Diwakar Mishra and Kalika Bali, in the proceedings of "5th International Sanskrit Computational Linguistics Symposium", Malhar Kulkarni (ed), DK Printworld, Delhi, 2013
  1. Using the ILCI Annotation Tool for POS Annotation: A Case of Hindi, Ritesh Kumar, Shiv Kaushik, Pinkey Nainwani, Esha Banerjee, Sumedh Hadke, Girish Nath Jha, International Journal of Computational Linguistics and Applications (IJCLA), VOL. 3, NO. 2, JUL-DEC 2012, PP. 93–104, guest editor: Yasunari Harada, Editor in-chief: Alexander Gelbukh, Bahri Publications, Delhi, ISSN: 0976-0962.
  1. Arya vs Dravida and linguistc foundations of Indian languages,  International WAVES Conference 2012, University of Massachusetts, Dartmouth, MA, USA, July14, 2012.
  1. Developing Sanskrit Corpora Based on the National Standard: Issues and Challenges, with Madhav Gopal, proceedings of "Workshop on Indian Language Data:Resource and Evaluation", Language Resource Evaluation Conference (LREC), Istanbul, e-publication by European Language Resource Association (ELRA), May, 2012
  1. Challenges in Developing Named Entity Recognition System for Sanskrit, with Sachin Kumar and Sobha Lalitha Devi, proceedings of "Workshop on Indian Language Data:Resource and Evaluation", Language Resource Evaluation Conference (LREC), Istanbul, e-publication by European Language Resource Association (ELRA), May, 2012
  1. Grapheme-to-Phoneme converter for Sanskrit Speech Synthesis, with Diwakar Mishra and Kalika Bali,  proceedings of "Workshop on Indian Language Data:Resource and Evaluation", Language Resource Evaluation Conference (LREC), Istanbul, e-publication by European Language Resource Association (ELRA), May, 2012
  1. Text encoding and search for Āyurvedic texts: an interconnected lexical database,  proceedings of the workshop on “Adaptation of language resources and tools for processing cultural heritage objects”, Language Resource Evaluation Conference (LREC), Istanbul, e-publication by European Language Resource Association (ELRA), May, 2012 
  1. Transfer grammar for Sanskrit-Hindi Machine Translation, co-author with Kumar Nripendra Pathak, World Sanskrit Conference, Delhi, Jan5-10, 2012
  1. Strategies for Anaphora Handling in Panchatantra, co-author with Madhav Gopal, World Sanskrit Conference, Delhi, Jan5-10, 2012
  1. Theory of Anaphora resolution in traditional Sanskrit texts, co-author with Diwakar Mishra, World Sanskrit Conference, Delhi, Jan5-10, 2012
  1. Standard Text encoding scheme and search for Ayurvedic texts, co-author with Archana Tiwari, World Sanskrit Conference, Delhi, Jan5-10, 2012
  1. Strategies for Semantic representation of Karma Karaka (karturipsitatamam karmam), co-author with Manji Bhadra, World Sanskrit Conference, Delhi, Jan5-10, 2012
  1. Knowledgebase for Sanskrit-Hindi machine Translation divergences, co-author with Diwakar Mani, World Sanskrit Conference, Delhi, Jan5-10, 2012
  1. Resolving Anaphors in Sanskrit, co-author with Madhav Gopal, proceedings of 5th LTC, Poznan, Poland, Nov 25-27, 2011
  1. Creating Multilingual Parallel Corpora in Indian Languages, Co-author with Narayan Choudhary, proceedings of 5th LTC, Poznan, Poland, Nov 25-27, 2011
  1. Issues in annotating less resourced languages - the case of Hindi from Indian Languages Corpora Initiative (co-author with Pinkey Nainwani, Esha Banerjee, Shiv Kaushik), proceedings of 5th LTC, Poznan, Poland, Nov 25-27, 2011
  1. A Rule based Method for the Identification of TAM features in a PoS Tagged Corpus, Co-author with Narayan Choudhary, Pramod Pandey, proceedings of 5th LTC, Poznan, Poland, Nov 25-27, 2011
  1. "Origin and Development of Indian Languages and Linguistic Area", in Perspectives on the Origin of Indian Civilizations (Angela Marcantonio, Girish Nath Jha ed.), DK Printworld, Delhi, 2012
  1. "Paninian System of Languages: A Computational Paradigm", Invited paper in the "Vedanta & Science Symposium", Center for Indic Studies, University of Massachusetts, Dartmouth, July6, 2011
  1. Sanskrit verb argument valence: a computational analysis, co-author with Subhash Chandra, proceedings of the O-COCOSDA 2010, Kathmandu
  1. Translating politeness across cultures: case of Hindi and English, co-author with Ritesh Kumar (presented by Ritesh Kumar), procs of 3rd ACM International Conference on Inter-cultural collaboration, Copenhagen Business School, Denmark, 18-20 Aug, 2010
  1. SaHiT - Sanskrit-Hindi Translator, in the 8th International conference of the WAVES, at Trinidad & Tobago, West Indies, August 5, 2010
  1. Text Encoding and Search for Sanskrit at the "19th International Vedanta Congress", organized by Center of Indic Studies, University of Massachusetts, Dartmouth, July 30, 2010
  1. SATIAIT – progress report and  translation of Rasaayana-saara, presented at the "Symposium on Science and Technology in Ancient Indian Text", Center of Indic Studies, University of Massachusetts, Dartmouth, USA, July 26, 2010
  1. The TDIL program and the Indian Language Corpora Initiative (ILCI), Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10), published by ELRA, isbn 2-9517408-6-7, pp982-985, Valletta, Malta, May 19-21, 2010
  1. Multilingual Multimodal Resources of the future: challenges and opportunities for India (with Ritu Nidhi), abstract accepted for W3C international conference titled “World Wide Web: Technology, Standards and Internationalization Conference”, May 6-7, 2010, New Delhi
  1. Online Multilingual Amarakosha: the relational lexical database (with R. Chandrashekar, Umesh Kumar Singh, Vibhuti Nath Jha, Satyendra Pandey, Surjit Kumar Singh, Mukesh Mishra), demo paper in the proceedings of the 5th Global Wordnet Conference, IIT Mumbai, Feb1, 2010
  1. Annotating Sanskrit corpus – adapting IL-POSTS (with Madhav Gopal and Diwakar Mishra),  published in the proceedings of the 4th International Language technology Conference, Poznan, Poland, Nov 2009
  1. Discourse Anaphora and Resolution Techniques in Sanskrit, with Sobha L, Diwakar Mishra begin_of_the_skype_highlighting     , accepted as poster paper at the 7th Discourse Anaphora and Ambiguity Resolution Colloquium (DAARC-2009), held in Goa, India, Nov 5-6, 2009
  1. Linguistic pedagogy and e-learning: the case for Sanskrit, co-author with Prachi Sinha & Priti Bhowmik, accepted for “Contemporary Themes and Issues in Language Pedagogy” vol2, Vaishna Narang ed, 2009
  1. Automatic Indexing System for texts of Indian Intellectual Tradition, with Diwakar Mani, Umesh Kumar Singh, Prachi Sinha, presented at 18th International Vedanta Congress, University of Massachusetts, Dartmouth, July16-19,2009 
  1. Sanskrit Analysis System (SAS), with Manji Bhadra, Surjit Kumar Singh, Sachin Kumar, Subash, Muktanand Agrawal, R.Chandrasekhar, Sudhir K Mishra, in Lecture Notes in Computer Science 5402 Springer Berlin / Heidelberg, ISBN 978-3-540-93884-2 (Amba P. Kulkarni, Gérard P. Huet eds) pp116-133, 2009
  1. An algorithm for morpho-phonemic processing of Sanskrit, Sachin Kumar, Diwakar Mishra     , Girish Nath Jha, presented by Sachin Kumar & Diwakar Mishra begin_of_the_skype_highlighting      at 3rd National Conference on Methods and Models in Computing, School of Computer and Systems Sciences, JNU, Dec 9, 2008
  1. Architecture of dyna-content web-applications, Girish Nath Jha, invited paper at “Requirement Workshop on Assessment Needs of Website of Animal Sciences, Dairy Science, Education and Research along with Transfer of Technology under NAIP funded AGROWEB Sub-Project (Component-I)”, National Dairy Research Institute, Karnal, Oct 3, 2008
  1. Sanskrit Anaphors, with  Sobha L, Diwakar Mishra     , Surjit K Singh, Praveen Pralayankar, Johansson, C. (Ed.) Proceedings of the Second Workshop on Anaphora Resolution (2008), ISSN 1736-6305 Vol. 2, Cambridge Scholars Publishing, 2007
  1. Semantic processing in Panini’s karaka system, Girish N Jha, Sudhir K Mishra, ( Lecture Notes in Computer Science series) 5402 Springer Berlin / Heidelberg, ISBN 978-3-642-00154-3 (Gérard P. Huet, Amba P. Kulkarni, Peter Scharf eds) pp 239-252, 2009
  1. Bhaarat men bhasha praudyogiki: ek sarvekshan, Girish N Jha, Diwakar Mani, Diwakar Mishra     , Gaveshana, Kendriya Hindi Sansthana, Agra
  1. A Common Parts-of-Speech Tagset Framework for Indian Languages, Sankaran Baskaran, Kalika Bali, Tanmoy Bhattacharya, Pushpak Bhattacharyya, Monojit Choudhury, Girish Nath Jha, Rajendran S., Saravanan K., Sobha L., KVS Subbarao, LREC 2008 - 6th Language Resources and Evaluation Conference, May 26-June1, 2008, Marrakech, Morocco
  1. Issues in computer processing of languages – a case for Sanskrit, presented at the International Symposium on Brain Cognition & Behavior, jointly organized by Center for Linguistics, SLL&CS, J.N.U. and Utrecht University (UU), Netherlands, March14-15, 2008
  1. Designing a Common POS-Tagset Framework for Indian Languages (co-author with Sankaran Baskaran, Kalika Bali, Tanmoy Bhattacharya, Pushpak Bhattacharyya, Rajendran S, Saravanan K, Sobha L and Subbarao K V), presented (by Kalika Bali) at International Joint Conference on Natural Language Processing (VI workshop on Asian Language Resources), IIIT, Hyderabad, Jan 11, 2008
  1. Morphological  Analyzer  for  Great  Andamanese  Verbs:  Implementing  a Concatenative Template, co-author with Narayan  Choudhary and  Anvita Abbi, Vishwabharat  (April 2007 ­ January 2008 Journal) 113­-118. TDIL, New Delhi,
  1. Survey of e-content in virtual media: the case of Sanskrit, presented in a national seminar on teaching through e-learning, Central Institute of Higher Tibetan Studies, Sarnath, Dec29-30,2007
  1. software localization me bhasha sambandhi samasyaem, in Gaveshana, Kendriya Hindi Sansthan, Agra, vol88,2007
  1. Sanskrit Computational Linguistics, presented at a national seminar on “Relevance of Sanskrit for science and technology” organized by Gandhi Memorial National College, Ambala, Dec 8, 2007
  1. Inflectional Morphology Analyzer for Sanskrit, Sanskrit Computational Linguistics ( Lecture Notes in Computer Science series) 5402 Springer Berlin / Heidelberg, ISBN 978-3-642-00154-3 (Gérard P. Huet, Amba P. Kulkarni, Peter Scharf eds) pp 219-238, 2009
  1. Sanskrit e-learning: Grammar-Lexicon, paper presented at "National Seminar on Teaching of Sanskrit through e-learning" organized by NCERT, Aug30, 2007.
  1. Optimality re-investigation of Hindi stress system, Rainbow of Linguistics (Vol I, 2007, pp 72-78), T. Media Publication, 33, College Row, Kolkata-700009 
  1. Indian Theories of Knowledge compared with Western Theories and Artificial Intelligence, appeared in Creative Forum – journal of literary and creative writings (pp-63-76), Vol. 20, No.1, Jan-July2007, Bahri Publications(2006), New Delhi
  1. Localization (l10n) of web-applications: strategies for Hindi/Sanskrit, presented at the Workshop on Localization of e-Governance Applications: Issues & Standards, February 21-22, 2007, Napoleon II, Hotel Le Meridien, New Delhi, organized by MAIT and MCIT  
  1. Knowledge modeling in India: an Artificial Intelligence perspective, accepted for the Indology Journal, International Indology Conference, Goa, Feb8-10, 2007
  1. A Natural Language Interface for Databases,  procs of National Conference in Methods and Models in Computing [NCM2C 2006], School of Computer & Systems Sciences, J.N.U., Dec 2006, Macmillan India Ltd., Delhi, pp75-91
  1. Proposing a System for Identifying Divergent Sentences in Russian-English Machine Translation, co-author with Ainura Asamidinova,  procs of National Conference in Methods and Models in Computing [NCM2C 2006], School of Computer & Systems Sciences, J.N.U., Dec 2006, Macmillan India Ltd., Delhi, pp161-171
  1. Linguistic and cultural challenges for software localization (l10n): the case of Hindi/Sanskrit, presented at a national seminar on Technology development for content creation and Localization in Indian languages, organized by CSTT, MHRD, Govt. of India and RKGIT, Gaziabad at RKG Institute of Technology, Gaziabad, Dec 1, 2006
  1. Hierarchical vs. flat tagsets, presented at Microsoft Research India, Bangalore, 28th October, 2006
  1. Computer applications for Lexicography,  at Central Hindi Institute, Agra, in the workshop on “Hindi dialect dictionary”, Oct 16,2006
  1. Problems in Hindi Computing,  at Central Hindi Institute, Agra, in the workshop on “Hindi dialect dictionary”, Oct 16,2006
  1. Strategies for metrical analysis of Sanskrit text, co-author with Diwakar Mishra     , procs of National Seminar on Perspectives in Linguistics, Kashmir University, Srinagar, 2006
  1. Issues in Sandhi Processing of Sanskrit co-author with Sachin Kumar, presented by Sachin at  28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
  1. Karaka based Sentence validation System,  co-author with S.K. Mishra, presented by S.K.Mishra at  28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
  1. Sanskrit language pedagogy: An e-learning approach , co-author with Priti Bhowmik, presented by Girish N Jha at  28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
  1. Online indexing system for Ashtadhyayi, co-author with Diwakar Mishra     , presented by Diwakar Mishra      at  28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
  1. Verb phrase Analyzer of Great Andamanese , co-author with Narayan Kumar Chaudhary, Anvita Abbi, presented by Narayan at  28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
  1. Strategies for Identifying and Processing Derived Nouns in Sanskrit , co-author with Surjit Kumar Singh, presented by Surjit at  28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
  1. Translating Russian Impersonal Constructions into English: A Study by MT Errors, co-author with  Ainura Asamidinova, P.K. Pandey, abstract published by the 28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
  1. Sanskrit Subanta Recognizer and Analyzer for Machine Translation , co-author with Subash, presented by Subash at  28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
  1. Online Indexing of Ādiparva of Mahābhārata , co-author with Diwakar Mani, presented by Diwakar Mani at  28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
  1. Regional and Linguistic perspectives on Internationalization of Web: The case of Hindi/Sanskrit, presented at Summit on Internationalization, organized by W3C and MAIT, at hotel Grand Ashoka, Bangalore, August 24-25, 2006
  1. Towards a Computational Analysis System for Sanskrit, invited paper at The First National Symposium on Modeling and Shallow Parsing of Indian Languages (MSPIL-06) 31st March - 4th April, 2006 at IIT Bombay, appeared in the procs of the symposium
  1. Computational lexicography and Amarakosha : an online RDBMS approach, National Seminar of Language and Interface, Deptt of Linguistics, Delhi University, March 25, 2006
  1. Language Technology in India: A survey: November 2005 Issue of C.S.I. magazine
  1. A Paninian Sandhi analyzer for Sanskrit, co-author with Sachin(presented by Sachin at Platinum Jubilee International Conference, L.S.I. at Hyderabad University, Hyderabad 6-8 dec,2005)
  1. Morphological analysis of nominal inflections in Sanskrit, co-author with Subash(presented by Subash at Platinum Jubilee International Conference, L.S.I. at Hyderabad University, Hyderabad 6-8 dec,2005)
  1. Karaka analysis of complicated Sanskrit sentences, co-author with Sudhir K Mishra(presented by Sudhir K Mishra at Platinum Jubilee International Conference, L.S.I. at Hyderabad University, Hyderabad 6-8 dec,2005)
  1. Developing a Sanskrit Analysis System for Machine Translation, presented at the National Seminar on Translation Today: state and issues, Deptt. of Linguistics, University of Kerala, Trivandrum, August 23-25, 2005
  1. Socio-psychological motivations for Maithili-Hindi code switching, procs of the AsiaLex conference at National University of Singapore, Singapore, June1-3, 2005
  1. Information technology applications for Sanskrit lexicography: case of Amarakosha, procs of the AsiaLex conference at National University of Singapore, Singapore, June1-3, 2005
  1. Devanagari on computer: problems and current status, presented at  the Workshop on Hindi support and compatibility for computers at JNU, New Delhi, March31-April1, 2005
  1. Identifying verb inflections in Sanskrit morphology, (with Sudhir Mishra) proceedings of SIMPLE’04, IIT Kharagpur, 2005
  1. IT initiatives for Sanskrit in India, accepted for the Sanskrit conference at Bangkok, Thailand, June 23-36, 2005
  1. Astadhyayi Online, (with Prof. G.V.Singh), Proceedings of International Conference on Speech and Language Technology, New Delhi, Nov,2004, Tata McGraw Hill, 2004
  1. A karaka analyzer for Sanskrit (with Sudhir Mishra), Proceedings of International Conference on Speech and Language Technology, New Delhi, Nov,2004, Tata McGraw Hill, 2004
  1. Generating nominal inflectional morphology in Sanskrit,   SIMPLE 04, IIT-Kharagpur Lecture Compendium, Shyama Printing Works, Kharagpur, WB, March, 2004
  1. Optimality re-investigation  of Hindi stress system, presented at 30th International conference on Systemic Functional Linguistics, Lucknow, 2003
  1. LexFace: a lexical interface for English-Hindi translation for share-market, South Asian Language Review , Vol. No. XI. P 52-59, New Delhi
  1. The system of Panini, Language  in India, volume4:2 February 2004
  1. Current trends in Indian languages technology Language In India, volume 3:12 December 2003
  1. Jason lives ! The innate horror !! Language in India, volume 3:10 October 2003
  1. A Prolog Analyzer/Generator for Sanskrit Subanta Padas, Language in India, volume 3:11 November 2003
  1. English-SQL interface for databases (with Jerry Morgan, head of the dept of linguistics at U of I, Urbana-Champaign) presented at an international conference on  SALA held at University of Illinois in July 1999
  1. Lexical conceptual structure and domain based machine translation system (with Kapil Kapoor, G.V.Singh) presented at a symposium on Machine Aids for translation & communication, JNU, April11-12 1996)
  1. Proposing a computational system for Nominal Inflectional Morphology in Sanskrit (Proc. of national seminar on “Reorganization of Sanskrit Shastras with a view to prepare their computational database”, January, 1995)
  1. Indian theory of knowledge: an AI perspective (proc. of national seminar on “Inference Mechanisms in Shastras and Computer Science”, Academy of Sanskrit Research, Malcote, Mysore, April, 1994)

Books

  1. Perspectives on the Origin of Indian Civilizations (ed jointly with Angela Marcantonio), DK Printworld, Delhi, 2012
  2. Modern Perspectives on Vedanta (ed jointly with Bal Ram Singh, R P Singh, Diwakar Mishra), DK Printworld, Delhi, 2012
  3. Computational Semantics for Sanskrit: The Case of Amarakosha Homonyms(co-author with Mukesh Kumar Mishra), Lambert Academic Publisher, Germany, 2012
  4. Science and Technology in Ancient Indian Texts (ed. jointly with Bal Ram Singh, Umesh Kumar Singh and Diwakar Mishra), DK Printworld, Delhi, 2011
  5. Paninian Morphophonemics: A Computational Approach to Analysis and Interpretation (co-author with Diwakar Mishra), Lambert Academic Publisher, Germany, 2011
  6. Primary Derivation in Sanskrit: A Computational Approach to Analysis of Kridanta (co-author with Surjit Kumar Singh), Lambert Academic Publisher, Germany, 2011
  7. "Computer Processing of Sanskrit Nominal Inflections - Methods and Implementaiton", (with Subhash Chandra), Cambridge Scholar Publishing, Feb 2011, UK (ISBN13: 978-1-4438-2684-6 Isbn: 1-4438-2684-7)
  8. Sanskrit Computational Linguistics (ed.), proceedings of the 4th International Sanskrit Computational Linguistics Symposium, Springer Verlag, Germany, 2010 (ISBN 978-3-642-17527-5)
  9. Sanganak-janit vyavaharik Sanskrit dhatu-rupavali,  Vidyanidhi Prakashan, Delhi, 2007

Forthcoming Books

  1. Rasaayana-saara: English Translation of the chapter of dhatu shodhana maarana prakarana, with Umesh Kumar Singh and Diwakar Mishra, Vidyanidhi Prakashana, Delhi, 2013
  2. Rasasanketa Kalika: Hindi Translation, with Umesh Kumar Singh and Diwakar Mishra, Vidyanidhi Prakashana, Delhi, 2013

Contributions to Books

  1. Maithili bhasha (with Dr Sangeeta Kumari),  chapter in the book titled Bharatiya Bhasha Parichay,  Kendriya Hindi Nideshalay, Delhi, 2010
  2. Hindi-English Dictionary of Common Words (Vyavaharik Angrezi-Hindi kosh),    (edit. board),   Central Hindi Directorate. Min. of Human Resource Development, 2006
  3. Early Monsoon, appeared in “The Splendor of Sunrise” (Jessica Rapisarda ed.), 2001, International library of poetry, Watermark press, One Poetry Plaza, Owings Mills, MD 21117, USA
  4. Bhashavijnana paribhasha-kosha,   contributed computational linguistics definitions in this volume published by C.H.D., Edn. deptt. of Min. of HRD, 1999

Reviewers for Journals

Language Resource Evaluation (LRE)

Transactions on Asian Language Information Processing (TALIP)

Membership of Committees

  1. Nominated member of Under Graduate Board of Studies (UG-BOS) in the BPS Womens' University, Sonepat, Haryana (2014-2016)
  2. Nominated to the Board of Directors (BoD) of World Association for Vedic Studies (WAVES), USA, 2012 onwards
  3. Nominated a member of the Expert Committee on Linguistic Resources and Language Technology Standards by Ministry of Communications & Information Technology, Department of Information Technology, Technology development for Indian Languages (TDIL) in 2012
  4. Nominated member of the Project Review Steering Group (PRSG) of the project Establishment of Indian Language Technology Proliferation & Deployment Centre by Ministry of Communications & Information Technology, Department of Information Technology, Technology development for Indian Languages (TDIL) in 2012
  5. Nominated as Institutional member of the Steering Committee of FLaReNet, European Commission project, Pisa, Italy, Oct 2010 onwards
  6. Nominated as a member of Board of Studies of Mahatma Gandhi International Hindi University, Wardha
  7. Nominated as permanent member in the Expert Committee on CLDR, by the Ministry of Communications and Information Technology (MCIT), Govt. of India
  8. Nominated to the Indian Language Technologies and Products Sectional Committee (LITD 20) of Bureau of Indian Standards
  9. Nominated member of the PRSG for Speech Technology projects by the Technology Development  for Indian Languages (TDIL) Programme of the Ministry of Communications and Information Technology (MCIT), Govt. of India
  10. Served as member of the Working Group on Technology Development  for Indian Languages (TDIL) Programme of the Ministry of Communications and Information Technology (MCIT), Govt. of India, 2007-2009
  11. Nominated member of the Working Group on Localization of Applications & Language Technology Standards, as identified under the National  e-Governance Program (NeGP) of Govt of India
  12. Nominated member of Expert group on Computational Linguistics by Manufacturer’s Association on Information Technology (MAIT) and Ministry of Communications and Information Technology (MCIT), Govt. of India
  13. Nominated member of Managing Committee of Chinmaya Vidyalaya, Vasant Vihar, New Delhi

Organized Seminar/Workshops

  1. Organized a workshop on Sharada script in collaboration with Millennium India Education Foundation (MIEF) at the Special Center for Sanskrit Studies, Nov 8-10, 2013
  2. Organized a workshop on Sharada script in collaboration with Millennium India Education Foundation (MIEF) at the Special Center for Sanskrit Studies, Feb 18-20, 2014
  3. Organized a workshop jointly with Microsoft, USA on "Indian Language Technology and English-Urdu Machine Translation" at Special Center for Sanskrit Studies, JNU, 26-27 July, 2012
  4. Workshop on Indian Language Data: Resource and Evaluation (WILDRE), Organized in collaboration with Microsoft Research India under Language Resource Evaluation Conference (LREC) organized by European Language Resource Evaluation (ELRA), Istanbul, May21, 2012
  5. Digitization of India's ancient scripts - Brahmi, Sharada and Grantha at Special Center for Sanskrit Studies, JNU on 20th March, 2012
  6. 7th ILCI workshop on POS annotation, in collaboration with IIITM Kerala, organized at Trivandrum 9-10 March, 2012
  7. 6th ILCI workshop on POS annotation, in collaboration with IIT Bombay organized at IIT Bombay 3-4 Jan, 2012
  8. 20th International Congress of Vedanta at Special Center for Sanskrit Studies, JNU, 28-31 December, 2011
  9. 5th ILCI workshop on POS annotation, in collaboration with Utkal University, Bhubaneswar and ISI Kolkata at Utkal University, Bhubaneswar, 21-23 February, 2011
  10. 4th International Sanskrit Computational Linguistics Symposium at Special Center for Sanskrit Studies, JNU, 10-12 December, 2010
  11. E-learning and Multimedia for Education, at Special Center for Sanskrit Studies, JNU, November 3, 2010 (in collaboration with Communication & Information Services, JNU)
  12. 4th ILCI workshop on POS annotation, in collaboration with Gujrat University, Ahmedabad, 28-30 September, 2010
  13. 3rd ILCI workshop on multilingual parallel corpora building, in collaboration with IIITM-K Trivandrum at Trivandrum, April 9, 2010 
  14. International Seminar on Science & Technology in Ancient Indian Texts (SATIAIT), January 9-10, 2010 in collaboration with Center of Indic Studies, University of Massachusetts, Dartmouth, USA
  15. 2nd ILCI workshop on multilingual parallel corpora building, in collaboration with Dravidian University at Kuppam, AP, 28-29 August, 2009 
  16. 1st ILCI workshop on multilingual parallel corpora building, in collaboration with Goa University at Goa, 25-26 March, 2009 
  17. Meeting On Science & Technology In Ancient Sanskrit Texts in collaboration with Prof. Bal Ram Singh, Director, Indic Studies, Univ Massachusetts, Dartmouth, at Dev Sanskriti Vishva Vidyalaya, Haridvar, 18 June 2009
  18. Workshop on Sanskrit E-Learning And Multimedia for school teachers and students of south Delhi public schools, JNU, May 5,2009
  19. Panchatantra Painting/Sketching Event And Workshop For Children, 14 November, 2008
  20. workshop on Hindi Support And Compatibility For Computers with Hindi Unit of  J.N.U., New Delhi, March 31-April 1, 2005

Membership of Program Committee/Scientific Committee

  • 14th International Conference on Natural Language Processing (ICON-2014) to be held at Goa, India, Dec 18-21, 2014
  • 2nd Workshop on Indian Language Data: Resources & Evaluation (WILDRE) under LREC, 2014, Iceland
  • 2nd Workshop on Indian Language Data: Resources & Evaluation (WILDRE) under LREC, 2014, Iceland
  • Language Resource & Evaluation Conference (LREC), 2014, Iceland
  • Workshop on "Linguistics Olympiad and Popularizing Linguistics in Schools", ICON 2013, India
  • 6th Language Technology Conference (LTC), 2013, Poland
  • Forum for Information Retrieval and Evaluation (FIRE), 2013, India
  • 21st International Congress of Vedanta, 2013, USA
  • 1st Workshop on Indian Language Data: Resources & Evaluation (WILDRE) under LREC, 2012, Turkey
  • Language Resource & Evaluation Conference (LREC), 2012, Turkey
  • 5th International Sanskrit Computational Linguistics Symposium (ISCLS), 2012, India
  • 5th Language Technology Conference (LTC), 2011, Poland
  • 20th International Congress of Vedanta, 2011, India
  • 5th Global Wordnet Association (GWA), 2010, India
  • 4th International Sanskrit Computational Linguistics Symposium (ISCLS), 2010, India
  • 3rd International Sanskrit Computational Linguistics Symposium (ISCLS), 2009, India
  • 2nd International Sanskrit Computational Linguistics Symposium (ISCLS) , 2008, USA
  • 1st International Sanskrit Computational Linguistics Symposium (ISCLS), 2007, France

Chairs for International Conferences

  • Track chair for the "Natural Language Processing" in the 18th International Conference on Knowledge-Based and Intelligent Information & Engineering Systems (KES-2014) , Poland
  • Chair for 2nd International Symposium on Natural Language Processing (NLP'13), India

Sessions chaired/Panel discussion

  1. Chaired a talk titled "Multilingual projection" (delivered by Prof Pushpak Bhattacharya, IIT, Bombay) at a national seminar on "Towards
    standardizing Khasi for Computational Purposes", St Anthony's College, Shillong, Oct 28, 2014
  2. A panelist in the panel discussion in the workshop "Workshop on Linguistics Olympiad and Popularizing Linguistics in Schools (LOPLingS)", ICON, CDAC Noida, Dec 18, 2013
  3. Chaired a session titled TAN 1: Text Analysis 1 in the "6th Language & Technology Conference: Human Language Technologies as a Challenge for Computer Science and Linguistics", Dec 7, 2013, Poznan, Poland
  4. A panelist in the Panel Discussion on Language diversity and big data coordinated by Hans Uszkoreit in the "6th Language & Technology Conference:Human Language Technologies as a Challenge for Computer Science and Linguistics", Dec 7, 2013, Poznan, Poland
  5. A panelist in the Panel Discussion on Challenges for Computational Linguistics Coordinated by Dafydd Gibbon in the "6th Language & Technology Conference:Human Language Technologies as a Challenge for Computer Science and Linguistics", Dec 9, 2013, Poznan, Poland
  6. A panelist in the Panel Discussion on "Modern Perspective on the Origin of Indian Civilization", in the 21st International Congress of Vedanta, University of Massacusetts, Dartmouth, July 11-14, 2013
  7. Chaired in the session titled "SCIENCE & TECHNOLOGY IN ANCIENT INDIAN TEXTS" in the 21st International Congress of Vedanta, University of Massacusetts, Dartmouth, July 11-14, 2013
  8. Chaired a keynote talk titled "The Vedic Cure for the Loss of the Sense of Mystery in Western Science" byJerry Solfvin in the 21st International Congress of Vedanta, University of Massachusetts, Dartmouth, July 11-14, 2013
  9. Chaired the session titled Vedic culture and epics, in the "International WAVES Conference 2012", University of Massachusetts, Dartmouth, MA, USA, July 14, 2012.
  10. Chaired the session titled Sanskrit Vyakarana and Linguistics, in the "International WAVES Conference 2012", University of Massachusetts, Dartmouth, MA, USA, July 14, 2012.
  11. Chaired a session in the seminar "Convergence of Science and Spirituality, Nov5, organized by ISSKCON, Sri Venkateswara College, South Campus Delhi University)
  12. Chaired a session in the "Symposium on Origin of Indian Civilization", Center of Indic Studies, University of Massachusetts, Dartmouth, 8-9 July, 2011
  13. Chaired a session in the "Vedanta and Science Symposium", Center of Indic Studies, University of Massachusetts, Dartmouth, 6-7 July, 2011
  14. Chaired a session on "Civilizational Harmony" in the 8th International conference of the WAVES, at Trinidad & Tobago, West Indies, August 5, 2010
  15. Chaired a plenary session in the "19th International Vedanta Congress", organized by Center of Indic Studies, University of Massachusetts, Dartmouth, July 29, 2010
  16. Chaired a panel discussion on "Sanskrit its evolution and representation in the digital age", in the "19th International Vedanta Congress", organized by Center of Indic Studies, University of Massachusetts, Dartmouth, July 30, 2010
  17. Chaired a session in “International Workshop on Referential Entity Resolution”, organized by AU-KBC, MIT campus, Anna University, Chennai, Jan15-17,2008
  18. Chaired a session on “Computational Linguistics” in the 28th All India Conference of Linguists, Department of Linguistics, BHU, on Nov 4, 2006
  19. Chaired a session in the National Seminar on Translation Today: state and issues, Deptt. of Linguistics, University of Kerala, Trivandrum, August 23-25, 2005

Workshops

  1. 8th ILCI workshop on POS annotation titled "National Workshop on BIS Annotation Scheme for Indian Languages"  in collaboration with IIIT Hyderabad, at Hyderabad 21-23 July, 2012
  2. Participated in 4 day workshop on Samasa Tagging at Poornaprajna Vidyapeetha, Bangalore, May 16-19, 2009
  3. Participated in two-day workshop on POS tagging for Sanskrit under the Sanskrit Consortia, organized by Dept. of Sanskrit Studies, University of Hyderabad, Hyderabad, June20-21, 2008
  4. Participated in a series of workshops on editing Hindi-English-Bodo Dictionary, at Central Hindi Directorate during first quarter of 2008
  5. International Workshop on Referential Entity Resolution, AU-KBC, MIT campus, Anna University, Chennai, Jan15-17,2008
  6. Review of speech technologies projects, organized by TDIL, MCIT, CDAC- Mumbai, Dec 12, 2007
  7. Indic languages tagset design, a one day workshop organized at Microsoft Research India Lab, Bangalore, Dec 2, 2007
  8. Indic languages tagset design,  a four day workshop organized at Microsoft Research India Lab, Bangalore, March15-18, 2007
  9. Indic languages tagset design,  a three day workshop organized at Microsoft Research India Lab, Bangalore, Jan19-21, 2007
  10. Indic languages tagset design,  a two day workshop organized at Microsoft Research India Lab, Bangalore, 2-3 Dec, 2006
  11. Indic languages tagset design,  a two day workshop organized at Microsoft Research India Lab, Bangalore, 28-29 Oct, 2006
  12. Government @27/7: overcoming contextual & cultural challenges , South Asia e-government summit organized at The Grand, Vasant Kunj, Oct 26, 2006 by DIT, GOI, CDAC, ASIA, OSPA forum

Supervision of Research Students

Projects

  1. Shallow Parser Tools (SP Tools) for Indian Languages - Maithili,  as Director, funded under the Technology Development for Indian Languages (TDIL) program by the Ministry of Communications and Information Technology (MCIT), March , 2012 onwards
  2. Indian Languages Corpora Initiative (ILCI), as Director and Consortium Leader, funded under the Technology Development for Indian Languages (TDIL) program by the Ministry of Communications and Information Technology (MCIT), March, 2009 onwards
  3. Developing Computational Tools For Sanskrit And Sanskrit-Hindi Machine Translation, as Director, funded under the Technology Development for Indian Languages (TDIL) program by the Ministry of Communications and Information Technology (MCIT), March 28, 2008 onwards
  4. Online Multilingual Amarakosa (2005-2007):  as Director: funded by JNU under UPOE program of UGC
  5. Database system for dialects of Hindi (2002-2003): as co-director: Under my supervision , a team of software engineers developed a VB-SQL server software for dialect mapping of Hindi, sponsored and run by MGAHU, New Delhi
  6. ALED (Active Living Everyday)  at Human Kinetics, USA: a dynamic content, interactive website for fitness management 2002: project manager and lead programmer
  7. Internationalization project of CollaTech e-solutions using JAVA resource bundles. Project involves making the source code compatible to the multilingual locales and languages, 2001
  8. Database development project at the University of Illinois’s Beckman Institute for Advanced Science and Technology 1998-1999: database programmer
  9. Analyzing schizophrenic speech at Dept. of Psychology, University Of Illinois, Urbana-Champaign, 1997: research assistant
  10. Developing technical terminology for Hindi (attended a series of workshops organized by Scientific and Technical Terminology Commission, Min of HRD, 1994-1996) and contributed in developing technical terminology for Hindi
  11. Computer Assisted Sanskrit Teaching Learning Environment (DoE sponsored project at JNU 1991-1994): research investigator

Technical Expertise

  • Machine Translation, computational lexicography computational grammar development, grammar analysis, interactive linguistic instruction, software internalization/localization
  • Natural Language Interface, Attribute Value Grammar (AVG) development, testing, parsing, algorithm implementation
  • Language Technology project management
  • B2B e-commerce solutions,  test drivers in Java Servlet/JSP, AWT/Swing, Apache/Tomcat, JDBC/SQL Server
  • RAD database and web applications, design and development of user interface in the windows/NT environment using Delphi, SQL Server, Interbase
  • project management in software development

Awards / Honors

  • DAAD fellowship (Germany) to teach Computational Linguistics at University of Wurzburg, Germany in 2014
  • Nominated to the Editorial Board of Springer for their journal Language Resource and Evaluation (LRE) since Feb 2010.
  • Mukesh and Priti Chatter Distinguished Professor of History of Science at University of Massachusetts Dartmouth, USA (2009-2012).
  • Evaluated as Exceptional Programmer by the HKUSA software director, 2002
  • Beckman Institute for Advance Science & Technology, University of Illinois Research Assistantship for advanced studies (Illinois, USA) 1998
  • Member of the prestigious Cognitive Science group at the University of  Illinois (USA): 1997-1999
  • Ranked Excellent Teacher twice by students of Hindi at University of Illinois (USA)  :1997,1998
  • University of Illinois  Urbana-Champaign Teaching Assistantship for teaching Hindi language and linguistics to U.S. students: (Illinois, USA) :1997
  • Dept. of Psychology, University of Illinois, Urbana-Champaign Research Assistantship (summer 1997) to study schizophrenic speech (Illinois, USA) 1997
go up