|
Education
M.A: JNU, New Delhi, 1991 in Linguistics
M.Phil: JNU, New
Delhi, 1993 in
Linguistics (Computational
Morphology)
Ph.D: JNU,
New Delhi: 1996 in Linguistics (Machine Translation)
M.A: University of Illinois,
Urbana-Champaign, USA: 1999 in
Linguistics (Natural Language Interface for databases)
Research Interest
Indian languages corpora and standards, Sanskrit and Hindi linguistics, Science & Technology in Sanskrit texts, Computational Lexicography, Machine Translation, e-learning, web based technologies, RDBMS techniques, software design and localization
Employment History
-
Mukesh and Priti Chatter Distinguished Professor of History of Science, University of Massachusetts Dartmouth, USA (2009- )
- Research and experimentation Scinece And Technology In Ancient Indian Texts (SATIAIT)
- taught a course "Sanskrit-II" at University of Massachusetts Dartmouth, July 5 - Agu 4, 2011
- taught a course "Sanskrit-I" at University of Massachusetts Dartmouth, May 31 - June 30, 2011
- taught a summer course titled Introduction to India’s culture and traditions, summer-II course, University of Massachusetts, Dartmouth, July 7-Aug 10, 2010
-
Associate Professor, Computational Linguistics,Special Center for Sanskrit
Studies, J.N.U (2008-till date)
- teaching computational linguistics/NLP and
computer application courses to the MA, M.Phil./Ph.D. students of Sanskrit
and Linguistics
- supervising research students, managing projects, R&D consultancies
-
Assistant Professor, Computational Linguistics, Special Center for Sanskrit
Studies, J.N.U (2002-2008)
-
Software Development Specialist, Human Kinetics, Champaign,
Illinois, USA (2001 – 2002)
- Worked as project manager and lead software engineer for building
educational and e-learning software applications using the tools ColdFusion,
SQL Server
- Advised HK-USA on software development tools and techniques
-
Software Engineer, Collaborative Technologies (renamed dChain
Commerce), Champaign, Illinois, USA (1999 – 2001)
- Worked as a lead programmer in developing web/windows based applications
using Java servlet / JSP, applets, AWT, Swing, SQL server/JDBC.
- Completed a project on internationalization of web-application using JAVA
resource bundles. Project involved making the source code compatible to the
marketplace locales and languages
-
Research Assistant, Beckman Institute for Advanced Science and
Technology, University of Illinois, Urbana-Champaign, USA (1998 - 1999)
- Worked with the lead database group of Beckman Institute System Services,
Beckman Institute for Advanced Science and Technology for conversion of
Paradox based systems to Delphi/Oracle.
-
Research
Assistant, Dept. of Psychology, University of Illinois,
Urbana-Champaign, USA (summer, 1997)
- Completed a syntactic analysis of recorded speech samples of schizophrenic
patients
-
Teaching Assistant, Dept. of Linguistics, University of Illinois,
Urbana-Champaign, USA (1996-1998)
- Taught elementary and intermediate Hindi linguistics courses to undergrad
students
- Prepared new courses, evaluated and graded student
Consultancy (7)
-
-
Linguistic Data Consortium (LDC), University of Pennsylvania, Philadelphia, USA for narrow band data for language identification by computers, January - April 2011
-
Center of Indic Studies, University of Massachusetts, Dartmouth, USA, for exploring scientific texts in Sanskrit, October 2009 -
-
Centre for Development of Advanced Computing (CDAC) Pune for their Sanskrit localization project, August-October 2007
-
Microsoft Research India Lab, Bangalore for their POS tagging project for Indic languages, November 2006-January 2007
-
Microsoft India on their project on Handwriting Recognition for Devanagari, February – June 2006
-
Mahatma Gandhi International Hindi University, Wardha (then New Delhi)on their Database system for dialects of Hindi project (2002-2003)
Invited Lectures/speeches
(81)
-
Resource creation for Indian languages - an update, invited talk at a workshop on "Reinforcing International Collaboration on Language Resources and Evaluation", joint COCOSDA-FLaReNet workshop, LREC, Istanbul, May 26, 2012
-
Emerging trends in Language Technology Resources (LTR) in India, delivered at Microsoft Research, Redmond, USA, Apr 9, 2012
-
Science and Technology initiatives for Indian Languages:
Impacting India through US and indigenous expertise, delivered at an International seminar on "Presence of America in India", University of Illinois, Urbana-Champaign, IL, USA, Apr 7, 2012
-
"Sanskrit e-learning", invited paper in a national seminar on education at Shri Lal Bahadur Rashtriya Sanskrit Vidyapeeth, Delhi, Feb17, 2012
-
Sanskrit shodh ki naii dishaaen, talk delivered at Shri Lal Bahadur Rashtriya Sanskrit Vidyapeeth, Delhi, Feb10, 2012
-
Newer areas of R & D in Sanskrit at JNU: a report, DPS Society, Dwarka, Jan18, 2012
-
Computational Linguistics Corpora and standards: the big picture, distinguished lecture at JNU lecture series, Feb 3, 2012
-
Computational Linguistics Corpora and standards: the Indian situation, talk delivered at IIT Patna, Jan30, 2012
-
Enabling Indian languages on computers, Workshop on Indian language computing, Mohini Devi Memorial School, Araria, 13-16 Jan, 2012
-
"Indian languages corpora Initiative: current status and future diections", inaugural talk at ILCI workshop organized by IIT Mumbai, Jan3-4, 2012
-
"Promoting Sanskrit through Science and Technology: A report on the latest developments at JNU", 20th Internatioal Vedanta Congress, JNU, Dec28-31, 2011
-
Introduction to programming in Java, , NLP Lecture Series at Kashmir University, sponsored by LDC-IL, CIIL Mysore, Nov8-9, 2011
-
"Linguistic Theory and Grammar Formalisms", NLP Lecture Series at Kashmir University, sponsored by LDC-IL, CIIL Mysore, Nov8-9, 2011
-
"Science & Technology in Sanskrit at JNU: A report", Convergence of Science and Spirituality, Nov 5, 2011, organized by ISSKCON, Sri Venkateswara College, South Campus Delhi University)
-
"Issues in POS tag-set design for Indian languages", Sept19, 2011, Inaugural talk in the "Workshop on POS-Morph: Urdu-Hindi& Maithili", organized by LDC-IL, CIIL Mysore, 19-21 Sept, 2011, CIIL Mysore
-
Computational Linguistics and Sanskrit, lecture delivered at International Sanskrit conference at Uttaranchal Sanskrit University, Haridwar, 11th Sept, 2011
-
"Purushartha and Yoga: a no-contradiction approach", invited talk in the "Summer Workshop on Dharmic Traditions", Center for Indic Studies, University of Massachusetts, Dartmouth, July19 & July26, 2011
-
"Modern Hindu Practices and their evolution", invited talk in the "Summer Workshop on Dharmic Traditions", Center for Indic Studies, University of Massachusetts, Dartmouth, July19, 2011
-
"What is dharma: following a textual strategy", invited talk in the "Summer Workshop on Dharmic Traditions", Center for Indic Studies, University of Massachusetts, Dartmouth, July18, 2011
-
"Why Study Sanskrit", talk delivered in "Trindic Corps Youth Camp", Organized by Center of Indic Studies and 3R Foundation, University of Massachusetts, Dartmouth, July1-4, 2011
-
Computational Linguistics, Talk delivered at Hindu College, Dec 21, 2010
-
Sanskrit and Computer, lecture delivered at NCERT teachers orientation, Dec 1, 2010
-
Language Technology Resources for Sanskrit and other Indian Languages at Jawaharlal Nehru University, India, LDC Institute, University of Pennsylvania, July 22, 2010
-
Multilingual Multimodal resources – creating a leveling field in India, COCOSDA/WRITE/FLaReNet workshop “language Technology Issues for International cooperation”, LREC2010, Malta, 22 May, 2010
-
Indian Languages Corpora and Application Development at JNU, Prof R N Shrivastava Memorial Lecture, Kendriya Hindi Sansthan, Agra, March 23, 2010
-
Language Processing and Corpus Linguistics, Prof R N Shrivastava Memorial Lecture, Kendriya Hindi Sansthan, Agra, March 22, 2010
-
Language Technology & Sanskrit, Dept of Sanskrit, Hansraj College, Delhi University, Feb 18, 2010
-
Computational Linguistics for Sanskrit, PhD research workshop, Sri Lal Bahadur Shastri Rashtriya Sanskrit Vidyapeeth, New Delhi, Feb 16, 2010
-
A national effort to develop technologies for multiple national languages the experience of India, FLaReNet (Fostering Language Resource Network), funded by European Commission, Barcelona, Spain, Feb 12, 2010
-
International Collaboration: Europe and India, FLaReNet (Fostering Language Resource Network), funded by European Commission, Barcelona, Spain, Feb 12, 2010
-
Why should you consider Sanskrit, Sanskrit Dept, Mothers School, Jan 22, 2010
-
Hamaare gaon, shiksha aur computer, Kuruom Vidyalaya: Inauguration and Educational Workshop, Jan 14, 2010, Kuruom, Sultanpur, UP
-
Web based technologies, Web Creations, Lucknow, Jan 13, 2010
-
Linguistics for Sanskrit, “Workshop on Sanskrit and Computers: Getting equipped to face the new challenges" organized jointly by the Department of Sanskrit Studies, University of Hyderabad, and the Sanskrit Academy, Osmania University, Hyderabad, 20 Dec 2009
-
Java for Computational Linguistics, “Workshop on Sanskrit and Computers: Getting equipped to face the new challenges" organized jointly by the Department of Sanskrit Studies, University of Hyderabad, and the Sanskrit Academy, Osmania University, Hyderabad, 21-23 Dec 2009
-
Computational linguistics R&D at JNU, “Workshop on Sanskrit and Computers: Getting equipped to face the new challenges" organized jointly by the Department of Sanskrit Studies, University of Hyderabad, and the Sanskrit Academy, Osmania University, Hyderabad, 23 Dec 2009
-
Computational Linguistics and TDIL Programme, panel discussion on “Getting less-resourced languages on board - which languages, which resources, which methods, which funding?", 4th International Language and Technology Conference (4th LTC), Poznan, Poland, Nov 6, 2009
-
Indian Language Corpora Initiative (ILCI), invited talk, 4th International Language and Technology Conference (4th LTC), Poznan, Poland, Poznan, Poland, Nov 6, 2009
-
Sanskrit and Computer, Sanskrit Dept, Gargi College, Delhi University, Sept 17,2009
-
Documentation initiatives for Sanskrit and other Indian languages, International seminar on language documentation, Center for Linguistics, School of Languages Literatures & Culture Studies, JNU, Oct 27, 2009
-
JNU Text Encoding & Search for Sanskrit (J-TESS), Rashtriya Sanskrit Sansthan, Delhi, Oct 26, 2009
-
Issues in Sanskrit NLP and e-learning, Kanya Mahavidyalaya, Jallandhar, Aug 7,2009
-
Sanskrit aur Kamputar, DPS HRD Center, Dwarka, New Delhi, Apr 15, 2009
-
Emerging trends in Language Technology and Sanskrit Computing at the Science Festival 2009, organized by School of Life Sciences, J.N.U., Feb 27, 2009
-
Methods and Issues in Computer Processing of Panini, Bhandarkar Oriental Research Institute, Pune, Feb22, 2009
-
Introduction to Language Technology, Center of Indian Languages, SLL&CS, JNU, Jan 23, 2009
-
Introduction to Natural Language Processing, refresher course for Computer Science, Academic Staff College, JNU, Nov 14, 2008
-
Demonstration of Sanskrit software developed at JNU (included as part of Sanskrit CD released by Ministry of IT and CDAC Pune, Nov 25), Tirupati Sanskrit Vidyapeeth
-
Multimedia e-learning for Sanskrit, at DPS, HRD Center, Dwarka, New Delhi, Nov 16, 2008
-
Issues in Sanskrit NLP and e-learning, Dept of Sanskrit, Pali and Prakrit, MD University Rohtak, Nov 7, 2008
-
Sanskrit and Computer, Haryana Sanskrit Academy at Bhiwani, 28/9/2008
-
Plenary speech in the international workshop titled Anaphora Resolution II, organized by Bergen University, Bergen, Norway, Aug29-31, 2008
-
Lecture series on NLP and Panini, Deptt of Vyakarana, Faculty of Arts, Banaras Hindu University (BHU), March 25-26, 2008
-
Teaching through e-learning, Central Institute of Higher Tibetan Studies, Sarnath, Dec29-30,2007
-
Multilingualism, localization and language computing, IP Engineering College, Gaziabad, Nov 17, 2007
-
Invited speech at First International Symposium on Sanskrit Computational Linguistics, INRIA, at Paris-Rocquencourt, 29 Oct-31 Oct 2007
-
Sanskrit Computational Linguistics at Sanskrit Shodh Parishad , Delhi University, August 1, 2007
-
Lecture series on Language Technology at the NLSI summer training, CIIL Mysore, June 18-19, 2007
-
Computational Linguistics, Dept of Linguistics, Kashmir University, Srinagar, Feb23, 2007
-
Lecture-series in the area of Sanskrit Computational Linguistics at the Dept of NLP and Shabdabodha, Rashtriya Sanskrit Vidyapeetham, Tirupati, Andhra Pradesh, Jan25-27, 2007
-
Computational Phonology and Morphology, PGDLT course at CDAC, Noida, Jan 5, 2007
-
Language Technology. PGDLT course at CDAC, Noida, Dec 21, 2006
-
Lecture series on Language Technology , MA course on Language Technology at Mahatma Gandhi International Hindi University, Wardha, 11-15 Dec, 2006
-
Sanskrit and computer, Haryana Sanskrit Academy at Panchkula, Chandigarh, Nov 26, 2006
-
Hindi Lexicography and computer, at Central Hindi Institute, Agra, at the workshop on “Hindi dialect dictionary” , Oct 16,2006
-
Sanskrit and Computer, Golden Jubilee Celebration of PGDAV College, Delhi University (organized by Delhi Sanskrit Academy, Sept 29, 2006
-
Invited talk at Summit on Internationalization of Web at Hotel Grand Ashoka, Bangalore (24-25 August, 2006) organized by World Wide Web Consortium (W3C) and Manufacturers’ Association of Information Technology (MAIT)
-
Computational Linguistics: an overview, CDAC Kolkata, July 6, 2006
-
Linguistic theories and formal grammar models, CDAC Kolkata, July 5, 2006
-
Paninian Formalism for NLP, CDAC Kolkata, July 4, 2006
-
Invited talk at the First National Symposium on Modeling and Shallow Parsing of Indian Languages(MSPIL-06) 31st March - 4th April, 2006 at IIT Bombay
-
Language and computers, ASC Orientation Course, Rajdhani College, Delhi University, Dec 27,2005
-
Invited speech at the National Seminar on Translation Today: state and issues, Deptt. of Linguistics, University of Kerala, Trivandrum, August 23-25, 2005
-
Machine Translation in the MA course on Translation Technology at Mahatma Gandhi Antarrashtriya Hindi University, Wardha, April11-16, 2005
-
Computational aspects of Indian grammatical tradition at the 4th course on IGT organized by CIIL,Mysore at NRLC, Patiala, Feb2005
-
Computational aspects of Panini at the Refresher course on Sanskrit Linguistics, organized by CIIL,Mysore at Deptt of Linguistics , Punjabi University, Patiala and at the NRLC, Patiala, Feb-March 2003
-
Computational linguistics, e-learning at the Orientation course on Information technology at Academic Staff College, JNU, August 2004
-
Computational linguistics, e-learning at the Orientation course on Information technology at Academic Staff College, JNU, December 2004
-
Macromedia ColdFusion for dynamic web content at UNESCO sponsored refresher course at School of Information Technology, JNU, September, 2003
-
Lectures on general linguistics, Kendriya Hindi Sansthan, Delhi: (1995-1996)
-
Lectures in UGC sponsored computer courses, Shri Lal Bahadur Shastri Rashtriya Sanskrit Vidyapeeth: (1995-1996)
Keynote talks
Delivered a keynote talk titled Current progress in developing Sanskrit and other linguistic resources, in International WAVES Conference 2012, University of Massachusetts, Dartmouth, MA, USA, July14, 2012
"Issues in POS annotation and limited crowd sourcing - the case of ILCIANN", keynote talk in a national seminar on "POS Annotation for Indian languages: Isues & Perspectives, organized by LDC-IL, CIIL Mysore, Dec 12-13, 2011
Computational Linguistics and Sanskrit, keynote talk delivered at a workshop on "Computational Sanskrit" at SD College Ambala, 11th Nov, 2011
Research
papers/articles(90)
-
Arya vs Dravida and linguistc foundations of Indian languages, International WAVES Conference 2012, University of Massachusetts, Dartmouth, MA, USA, July14, 2012.
-
Developing Sanskrit Corpora Based on the National Standard: Issues and Challenges, with Madhav Gopal, proceedigs of "Workshop on Indian Language Data:Resource and Evaluation", Language Resource Evaluation Conference (LREC), Istanbul, e-publication by European Language Resource Association (ELRA), May, 2012
-
Challenges in Developing Named Entity Recognition System for Sanskrit, with Sachin Kumar and Sobha Lalitha Devi, proceedigs of "Workshop on Indian Language Data:Resource and Evaluation", Language Resource Evaluation Conference (LREC), Istanbul, e-publication by European Language Resource Association (ELRA), May, 2012
-
Grapheme-to-Phoneme converter for Sanskrit Speech Synthesis, with Diwakar Mishra and Kalika Bali, proceedigs of "Workshop on Indian Language Data:Resource and Evaluation", Language Resource Evaluation Conference (LREC), Istanbul, e-publication by European Language Resource Association (ELRA), May, 2012
-
Text encoding and search for Āyurvedic texts: an interconnected lexical database, proceedings of the workshop on “Adaptation of language resources and tools for processing cultural heritage objects”, Language Resource Evaluation Conference (LREC), Istanbul, e-publication by European Language Resource Association (ELRA), May, 2012
- Transfer grammar for Sanskrit-Hindi Machine Translation, co-author with Kumar Nripendra Pathak, World Sanskrit Conference, Delhi, Jan5-10, 2012
- Strategies for Anaphora Handling in Panchatantra, co-author with MadhavGopal, World Sanskrit Conference, Delhi, Jan5-10, 2012
- Theory of Anaphora resolution in traditional Sanskrit texts, co-author with Diwakar Mishra, World Sanskrit Conference, Delhi, Jan5-10, 2012
- Standard Text encoding scheme and search for Ayurvedic texts, co-author with Archana Tiwari, World Sanskrit Conference, Delhi, Jan5-10, 2012
- Strategies for Semantic representation of Karma Karaka (karturipsitatamam karmam), co-author with Manji Bhadra, World Sanskrit Conference, Delhi, Jan5-10, 2012
- Knowledgebase for Sanskrit-Hindi machine Translation divergences, co-author with Diwakar Mani, World Sanskrit Conference, Delhi, Jan5-10, 2012
- Resolving Anaphors in Sanskrit, co-author with Madhav Gopal, proceedings of 5th LTC, Poznan, Poland, Nov 25-27, 2011
- Creating Multilingual Parallel Corpora in Indian Languages, Co-author with Narayan Choudhary, proceedings of 5th LTC, Poznan, Poland, Nov 25-27, 2011
- Issues in annotating less recourced languages - the case of Hindi from Indian Languages Corpora Initiative (co-author with Pinkey Nainwani, Esha Banerjee, Shiv Kaushik), proceedings of 5th LTC, Poznan, Poland, Nov 25-27, 2011
- A Rule based Method for the Identification of TAM features in a PoS Tagged Corpus, Co-author with Narayan Choudhary, Pramod Pandey, proceedings of 5th LTC, Poznan, Poland, Nov 25-27, 2011
- "Origin and Development of Indian Languages and Linguistic Area", in Perspectives on the Origin of Indian Civilizations (Angela Marcantonio, Girish Nath Jha ed.), DK Printworld, Delhi, 2012
- "Paninian System of Languages: A Computational Paradigm", Invited paper in the "Vedanta & Science Symposium", Center for Indic Studies, University of Massachusetts, Dartmouth, July6, 2011
- Sanskrit verb argument valence: a computational analysis, co-author with Subhash Chandra, proceedings of the O-COCOSDA 2010, Kathmandu
- Translating politeness across cultures: case of Hindi and English, co-author with Ritesh Kumar (presented by Ritesh Kumar), procs of 3rd ACM International Conference on Inter-cultural collaboration, Copenhegen Business School, Denmark, 18-20 Aug, 2010
- SaHiT - Sanskrit-Hindi Translator, in the 8th International conference of the WAVES, at Trinidad & Tobago, West Indies, August 5, 2010
- Text Encoding and Search for Sanskrit at the "19th International Vedanta Congress", organized by Center of Indic Studies, University of Massachusetts, Dartmouth, July 30, 2010
- SATIAIT – progress report and translation of Rasaayana-saara, presented at the "Symposium on Science and Technology in Ancient Indian Text", Center of Indic Studies, University of Massachusetts, Dartmouth, USA, July 26, 2010
- The TDIL program and the Indian Language Corpora Initiative (ILCI), Proceedings of the Seventh conference on International Language Resources and Evaluation (LREC'10), published by ELRA, isbn 2-9517408-6-7, pp982-985, Valletta, Malta, May 19-21, 2010
- Multilingual Multimodal Resources of the future: challenges and opportunities for India (with Ritu Nidhi), abstract accepted for W3C international conference titled “World Wide Web: Technology, Standards and Internationalization Conference”, May 6-7, 2010, New Delhi
- Online Multilingual Amarakosha: the relational lexical database (with R. Chandrashekar, Umesh Kumar Singh, Vibhuti Nath Jha, Satyendra Pandey, Surjit Kumar Singh, Mukesh Mishra), demo paper in the proceedings of the 5th Global Wordnet Conference, IIT Mumbai, Feb1, 2010
- Annotating Sanskrit corpus – adapting IL-POSTS (with Madhav Gopal and Diwakar Mishra), published in the proceedings of the 4th International Language technology Conference, Poznan, Poland, Nov 2009
- Discourse Anaphora and Resolution Techniques in Sanskrit, with Sobha L, Diwakar Mishra begin_of_the_skype_highlighting , accepted as poster paper at the 7th Discourse Anaphora and Ambiguity Resolution Colloquium (DAARC-2009), held in Goa, India, Nov 5-6, 2009
- Linguistic pedagogy and e-learning: the case for Sanskrit, co-author with Prachi Sinha & Priti Bhowmik, accepted for “Contemporary Themes and Issues in Language Pedagogy” vol2, Vaishna Narang ed, 2009
- Automatic Indexing System for texts of Indian Intellectual Tradition, with Diwakar Mani, Umesh Kumar Singh, Prachi Sinha, presented at 18th International Vedanta Congress, University of Massachusetts, Dartmouth, July16-19,2009
- Sanskrit Analysis System (SAS), with Manji Bhadra, Surjit Kumar Singh, Sachin Kumar, Subash, Muktanand Agrawal, R.Chandrasekhar, Sudhir K Mishra, in Lecture Notes in Computer Science 5402 Springer Berlin / Heidelberg, ISBN 978-3-540-93884-2 (Amba P. Kulkarni, Gérard P. Huet eds) pp116-133, 2009
- An algorithm for morpho-phonemic processing of Sanskrit, Sachin Kumar, Diwakar Mishra , Girish Nath Jha, presented by Sachin Kumar & Diwakar Mishra begin_of_the_skype_highlighting at 3rd National Conference on Methods and Models in Computing, School of Computer and Systems Sciences, JNU, Dec 9, 2008
- Architecture of dyna-content web-applications, Girish Nath Jha, invited paper at “Requirement Workshop on Assessment Needs of Website of Animal Sciences, Dairy Science, Education and Research along with Transfer of Technology under NAIP funded AGROWEB Sub-Project (Component-I)”, National Dairy Research Institute, Karnal, Oct 3, 2008
- Sanskrit Anaphors, with Sobha L, Diwakar Mishra , Surjit K Singh, Praveen Pralayankar, Johansson, C. (Ed.) Proceedings of the Second Workshop on Anaphora Resolution (2008), ISSN 1736-6305 Vol. 2, Cambridge Scholars Publishing, 2007
- Semantic processing in Panini’s karaka system, Girish N Jha, Sudhir K Mishra, ( Lecture Notes in Computer Science series) 5402 Springer Berlin / Heidelberg, ISBN 978-3-642-00154-3 (Gérard P. Huet, Amba P. Kulkarni, Peter Scharf eds) pp 239-252, 2009
- Bhaarat men bhasha praudyogiki: ek sarvekshan, Girish N Jha, Diwakar Mani, Diwakar Mishra , Gaveshana, Kendriya Hindi Sansthana, Agra
- A Common Parts-of-Speech Tagset Framework for Indian Languages, Sankaran Baskaran, Kalika Bali, Tanmoy Bhattacharya, Pushpak Bhattacharyya, Monojit Choudhury, Girish Nath Jha, Rajendran S., Saravanan K., Sobha L., KVS Subbarao, LREC 2008 - 6th Language Resources and Evaluation Conference, May 26-June1, 2008, Marrakech, Morocco
- Issues in computer processing of languages – a case for Sanskrit, presented at the International Symposium on Brain Cognition & Behavior, jointly organized by Center for Linguistics, SLL&CS, J.N.U. and Utrecht University (UU), Netherlands, March14-15, 2008
- Designing a Common POS-Tagset Framework for Indian Languages (co-author with Sankaran Baskaran, Kalika Bali, Tanmoy Bhattacharya, Pushpak Bhattacharyya, Rajendran S, Saravanan K, Sobha L and Subbarao K V), presented (by Kalika Bali) at International Joint Conference on Natural Language Processing (VI workshop on Asian Language Resources), IIIT, Hyderabad, Jan 11, 2008
- Morphological Analyzer for Great Andamanese Verbs: Implementing a Concatenative Template, co-author with Narayan Choudhary and Anvita Abbi, Vishwabharat (April 2007 January 2008 Journal) 113-118. TDIL, New Delhi,
- Survey of e-content in virtual media: the case of Sanskrit, presented in a national seminar on teaching through e-learning, Central Institute of Higher Tibetan Studies, Sarnath, Dec29-30,2007
- software localization me bhasha sambandhi samasyaem, in Gaveshana, Kendriya Hindi Sansthan, Agra, vol88,2007
- Sanskrit Computational Linguistics, presented at a national seminar on “Relevance of Sanskrit for science and technology” organized by Gandhi Memorial National College, Ambala, Dec 8, 2007
- Inflectional Morphology Analyzer for Sanskrit, Sanskrit Computational Linguistics ( Lecture Notes in Computer Science series) 5402 Springer Berlin / Heidelberg, ISBN 978-3-642-00154-3 (Gérard P. Huet, Amba P. Kulkarni, Peter Scharf eds) pp 219-238, 2009
- Sanskrit e-learning: Grammar-Lexicon, paper presented at "National Seminar on Teaching of Sanskrit through e-learning" organized by NCERT, Aug30, 2007.
- Optimality re-investigation of Hindi stress system, Rainbow of Linguistics (Vol I, 2007, pp 72-78), T. Media Publication, 33, College Row, Kolkata-700009
- Indian Theories of Knowledge compared with Western Theories and Artificial Intelligence, appeared in Creative Forum – journal of literary and creative writings (pp-63-76), Vol. 20, No.1, Jan-July2007, Bahri Publications(2006), New Delhi
- Localization (l10n) of web-applications: strategies for Hindi/Sanskrit, presented at the Workshop on Localization of e-Governance Applications: Issues & Standards, February 21-22, 2007, Napoleon II, Hotel Le Meridien, New Delhi, organized by MAIT and MCIT
- Knowledge modeling in India: an Artificial Intelligence perspective, accepted for the Indology Journal, International Indology Conference, Goa, Feb8-10, 2007
- A Natural Language Interface for Databases, procs of National Conference in Methods and Models in Computing [NCM2C 2006], School of Computer & Systems Sciences, J.N.U., Dec 2006, Macmillan India Ltd., Delhi, pp75-91
- Proposing a System for Identifying Divergent Sentences in Russian-English Machine Translation, co-author with Ainura Asamidinova, procs of National Conference in Methods and Models in Computing [NCM2C 2006], School of Computer & Systems Sciences, J.N.U., Dec 2006, Macmillan India Ltd., Delhi, pp161-171
- Linguistic and cultural challenges for software localization (l10n): the case of Hindi/Sanskrit, presented at a national seminar on Technology development for content creation and Localization in Indian languages, organized by CSTT, MHRD, Govt. of India and RKGIT, Gaziabad at RKG Institute of Technology, Gaziabad, Dec 1, 2006
- Hierarchical vs. flat tagsets, presented at Microsoft Research India, Bangalore, 28th October, 2006
- Computer applications for Lexicography, at Central Hindi Institute, Agra, in the workshop on “Hindi dialect dictionary”, Oct 16,2006
- Problems in Hindi Computing, at Central Hindi Institute, Agra, in the workshop on “Hindi dialect dictionary”, Oct 16,2006
- Strategies for metrical analysis of Sanskrit text, co-author with Diwakar Mishra , procs of National Seminar on Perspectives in Linguistics, Kashmir University, Srinagar, 2006
- Issues in Sandhi Processing of Sanskrit co-author with Sachin Kumar, presented by Sachin at 28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
- Karaka based Sentence validation System, co-author with S.K. Mishra, presented by S.K.Mishra at 28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
- Sanskrit language pedagogy: An e-learning approach , co-author with Priti Bhowmik, presented by Girish N Jha at 28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
- Online indexing system for Ashtadhyayi, co-author with Diwakar Mishra , presented by Diwakar Mishra at 28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
- Verb phrase Analyzer of Great Andamanese , co-author with Narayan Kumar Chaudhary, Anvita Abbi, presented by Narayan at 28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
- Strategies for Identifying and Processing Derived Nouns in Sanskrit , co-author with Surjit Kumar Singh, presented by Surjit at 28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
- Translating Russian Impersonal Constructions into English: A Study by MT Errors, co-author with Ainura Asamidinova, P.K. Pandey, abstract published by the 28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
- Sanskrit Subanta Recognizer and Analyzer for Machine Translation , co-author with Subash, presented by Subash at 28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
- Online Indexing of Ādiparva of Mahābhārata , co-author with Diwakar Mani, presented by Diwakar Mani at 28th All India Conference of Linguists, Department of Linguistics, BHU, 02-04 Nov 2006
- Regional and Linguistic perspectives on Internationalization of Web: The case of Hindi/Sanskrit, presented at Summit on Internationalization, organized by W3C and MAIT, at hotel Grand Ashoka, Bangalore, August 24-25, 2006
- Towards a Computational Analysis System for Sanskrit, invited paper at The First National Symposium on Modeling and Shallow Parsing of Indian Languages (MSPIL-06) 31st March - 4th April, 2006 at IIT Bombay, appeared in the procs of the symposium
- Computational lexicography and Amarakosha : an online RDBMS approach, National Seminar of Language and Interface, Deptt of Linguistics, Delhi University, March 25, 2006
- Language Technology in India: A survey: November 2005 Issue of C.S.I. magazine
- A Paninian Sandhi analyzer for Sanskrit, co-author with Sachin(presented by Sachin at Platinum Jubilee International Conference, L.S.I. at Hyderabad University, Hyderabad 6-8 dec,2005)
- Morphological analysis of nominal inflections in Sanskrit, co-author with Subash(presented by Subash at Platinum Jubilee International Conference, L.S.I. at Hyderabad University, Hyderabad 6-8 dec,2005)
- Karaka analysis of complicated Sanskrit sentences, co-author with Sudhir K Mishra(presented by Sudhir K Mishra at Platinum Jubilee International Conference, L.S.I. at Hyderabad University, Hyderabad 6-8 dec,2005)
- Developing a Sanskrit Analysis System for Machine Translation, presented at the National Seminar on Translation Today: state and issues, Deptt. of Linguistics, University of Kerala, Trivandrum, August 23-25, 2005
- Socio-psychological motivations for Maithili-Hindi code switching, procs of the AsiaLex conference at National University of Singapore, Singapore, June1-3, 2005
- Information technology applications for Sanskrit lexicography: case of Amarakosha, procs of the AsiaLex conference at National University of Singapore, Singapore, June1-3, 2005
- Devanagari on computer: problems and current status, presented at the Workshop on Hindi support and compatibility for computers at JNU, New Delhi, March31-April1, 2005
- Identifying verb inflections in Sanskrit morphology, (with Sudhir Mishra) proceedings of SIMPLE’04, IIT Kharagpur, 2005
- IT initiatives for Sanskrit in India, accepted for the Sanskrit conference at Bangkok, Thailand, June 23-36, 2005
- Astadhyayi Online, (with Prof. G.V.Singh), Proceedings of International Conference on Speech and Language Technology, New Delhi, Nov,2004, Tata McGraw Hill, 2004
- A karaka analyzer for Sanskrit (with Sudhir Mishra), Proceedings of International Conference on Speech and Language Technology, New Delhi, Nov,2004, Tata McGraw Hill, 2004
- Generating nominal inflectional morphology in Sanskrit, SIMPLE 04, IIT-Kharagpur Lecture Compendium, Shyama Printing Works, Kharagpur, WB, March, 2004
- Optimality re-investigation of Hindi stress system, presented at 30th International conference on Systemic Functional Linguistics, Lucknow, 2003
- LexFace: a lexical interface for English-Hindi translation for share-market, South Asian Language Review , Vol. No. XI. P 52-59, New Delhi
- The system of Panini, Language in India, volume4:2 February 2004
- Current trends in Indian languages technology Language In India, volume 3:12 December 2003
- Jason lives ! The innate horror !! Language in India, volume 3:10 October 2003
- A Prolog Analyzer/Generator for Sanskrit Subanta Padas, Language in India, volume 3:11 November 2003
- English-SQL interface for databases (with Jerry Morgan, head of the dept of linguistics at U of I, Urbana-Champaign) presented at an international conference on SALA held at University of Illinois in July 1999
- Lexical conceptual structure and domain based machine translation system (with Kapil Kapoor, G.V.Singh) presented at a symposium on Machine Aids for translation & communication, JNU, April11-12 1996)
- Proposing a computational system for Nominal Inflectional Morphology in Sanskrit (Proc. of national seminar on “Reorganization of Sanskrit Shastras with a view to prepare their computational database”, January, 1995)
- Indian theory of knowledge: an AI perspective (proc. of national seminar on “Inference Mechanisms in Shastras and Computer Science”, Academy of Sanskrit Research, Malcote, Mysore, April, 1994)
Books
-
Perspectives on the Origin of Indian Civilizations (ed jointly with Angela Marcantonio), DK Printworld, Delhi, 2012
-
Modern Perspectives on Vedanta (ed jointly with Bal Ram Singh, R P Singh, Diwakar Mishra), DK Printworld, Delhi, 2012
-
Computational Semantics for Sanskrit: The Case of Amarakosha Homonyms(co-author with Mukesh Kumar Mishra), Lambert Academic Publisher, Germany, 2012
-
Science and Technology in Ancient Indian Texts (ed. jointly with Bal Ram Singh, Umesh Kumar Singh and Diwakar Mishra), DK Printworld, Delhi, 2011
-
Paninian Morphophonemics: A Computational Approach to Analysis and Interpretation (co-author with Diwakar Mishra), Lambert Academic Publisher, Germany, 2011
-
Primary Derivation in Sanskrit: A Computational Approach to Analysis of Kridanta (co-author with Surjit Kumar Singh),
Lambert Academic Publisher, Germany, 2011
-
-
Sanskrit Computational Linguistics (ed.), proceedings of the 4 th International Sanskrit Computational Linguistics Symposium, Springer Verlag, Germany, 2010 (ISBN 978-3-642-17527-5)
-
Sanganak-janit vyavaharik Sanskrit dhatu-rupavali, Vidyanidhi Prakashan, Delhi, 2007
Forthcoming Books
-
Rasaayana-saara: English Translation of the chapter of dhatu shodhana maarana prakarana, with Umesh Kumar Singh and Diwakar Mishra, Vidyanidhi Prakashana, Delhi, 2012
-
Prachin Sanskrit Kavita – ek jhalak, with Ganga Nath Jha, Vidyanidhi Prakashan, New Delhi (2012)
-
Rasasanketa Kalika: Hindi Translation, with Umesh Kumar Singh and Diwakar Mishra, Vidyanidhi Prakashana, Delhi, 2012
Contributions to Books
-
Maithili bhasha (with Dr Sangeeta Kumari), chapter in the book titled Bharatiya Bhasha Parichay, Kendriya Hindi Nideshalay, Delhi, 2010
-
Hindi-English Dictionary of Common Words (Vyavaharik Angrezi-Hindi kosh), (edit. board), Central Hindi Directorate. Min. of Human Resource Development, 2006
-
Early Monsoon, appeared in “The Splendor of Sunrise” (Jessica Rapisarda ed.), 2001, International library of poetry, Watermark press, One Poetry Plaza, Owings Mills, MD 21117, USA
-
Bhashavijnana paribhasha-kosha, contributed computational linguistics definitions in this volume published by C.H.D., Edn. deptt. of Min. of HRD, 1999
Membership of
Committees
-
Nominated to the Board of Directors (BoD) of World Association for Vedic Studies (WAVES), USA, 2012 onwards
-
Nominated as Institutional member of the Steering Committee of FLaReNet, European Commission project, Pisa, Italy, Oct 2010 onwards
-
Nominated as a member of Board of Studies of Mahatma Gandhi International Hindi University, Wardha
-
Nominated as permanent member in the Expert Committee on CLDR, by the Ministry of Communications and Information Technology (MCIT), Govt. of India
-
Nominated to the Indian Language Technologies and Products Sectional Committee (LITD 20) of Bureau of Indian Standards
-
Nominated member of the PRSG for Speech Technology projects by the Technology Development for Indian Languages (TDIL) Programme of the Ministry of Communications and Information Technology (MCIT), Govt. of India
-
Served as member of the Working Group on Technology Development for Indian Languages (TDIL) Programme of the Ministry of Communications and Information Technology (MCIT), Govt. of India, 2007-2009
-
Nominated member of the Working Group on Localization of Applications & Language Technology Standards, as identified under the National e-Governance Program (NeGP) of Govt of India
-
Nominated member of Expert group on Computational Linguistics by Manufacturer’s Association on Information Technology (MAIT) and Ministry of Communications and Information Technology (MCIT), Govt. of India
-
Nominated member of Managing Committee of Chinmaya Vidyalaya, Vasant Vihar, New Delhi
Organized Seminar/Workshops
-
-
Digitization of India's ancient scripts - Brahmi, Sharada and Grantha at Special Center for Sanskrit Studies, JNU on 20th March, 2012
-
7th ILCI workshop on POS annotation, in collaboration with IIITM Kerala, organized at Trivandrum 9-10 March, 2012
-
6th ILCI workshop on POS annotation, in collaboration with IIT Bombay organized at IIT Bombay 3-4 Jan, 2012
-
-
5th ILCI workshop on POS annotation, in collaboration with Utkal University, Bhubaneswar and ISI Kolkata at Utkal University, Bhubaneswar, 21-23 February, 2011
-
-
E-learning and Multimedia for Education, at Special Center for Sanskrit Studies, JNU, November 3, 2010 (in collaboration with Communication & Information Services, JNU)
-
4th ILCI workshop on POS annotation, in collaboration with Gujrat University, Ahmedabad, 28-30 September, 2010
-
3rd ILCI workshop on multilingual parallel corpora building, in collaboration with IIITM-K Trivandrum at Trivandrum, April 9, 2010
-
-
2nd ILCI workshop on multilingual parallel corpora building, in collaboration with Dravidian University at Kuppam, AP, 28-29 August, 2009
-
1st ILCI workshop on multilingual parallel corpora building, in collaboration with Goa University at Goa, 25-26 March, 2009
-
Meeting On Science & Technology In Ancient Sanskrit Texts in collaboration with Prof. Bal Ram Singh, Director, Indic Studies, Univ Massachusetts, Dartmouth, at Dev Sanskriti Vishva Vidyalaya, Haridvar, 18 June 2009
-
Workshop on Sanskrit E-Learning And Multimedia for school teachers and students of south Delhi public schools, JNU, May 5,2009
-
Panchatantra Painting/Sketching Event And Workshop For Children, 14 November, 2008
-
workshop on Hindi Support And Compatibility For Computers with Hindi Unit of J.N.U., New Delhi, March 31-April 1, 2005
Sessions
chaired
-
Chaired the session titled Vedic culture and epics, in the "International WAVES Conference 2012", University of Massachusetts, Dartmouth, MA, USA, July 14, 2012.
-
Chaired the session titled Sanskrit Vyakarana and Linguistics, in the "International WAVES Conference 2012", University of Massachusetts, Dartmouth, MA, USA, July 14, 2012.
-
Chaired a session in the seminar "Convergence of Science and Spirituality, Nov5, organized by ISSKCON, Sri Venkateswara College, South Campus Delhi University)
-
Chaired a session in the "Symposium on Origin of Indian Civilization", Center of Indic Studies, University of Massachusetts, Dartmouth, 8-9 July, 2011
-
Chaired a session in the "Vedanta and Science Symposium", Center of Indic Studies, University of Massachusetts, Dartmouth, 6-7 July, 2011
-
Chaired a session on "Civilizational Harmony" in the 8th International conference of the WAVES, at Trinidad & Tobago, West Indies, August 5, 2010
-
Chaired a plenary session in the "19th International Vedanta Congress", organized by Center of Indic Studies, University of Massachusetts, Dartmouth, July 29, 2010
-
Chaired a panel discussion on "Sanskrit its evolution and representation in the digital age", in the "19th International Vedanta Congress", organized by Center of Indic Studies, University of Massachusetts, Dartmouth, July 30, 2010
-
Chaired a session in
“International Workshop on Referential Entity Resolution”, organized by AU-KBC,
MIT campus, Anna University, Chennai, Jan15-17,2008
-
Chaired a session on
“Computational Linguistics” in the 28th All India Conference of Linguists,
Department of Linguistics, BHU, on Nov 4, 2006
-
Chaired a session in the
National Seminar on Translation Today: state and issues, Deptt.
of Linguistics, University of Kerala, Trivandrum, August 23-25, 2005
Workshops
- Organized a workshop jointly with Microsoft, USA on "Indian Language Technology and English-Urdu Machine Translation" at Special Center for Sanskrit Studies, JNU, 26-27 July, 2012
- 8th ILCI workshop on POS annotation titled "National Workshop on BIS Annotation Scheme for Indian Languages" in collaboration with IIIT Hyderabad, at Hyderabad 21-23 July, 2012
- Participated in 4 day workshop on Samasa Tagging at Poornaprajna Vidyapeetha, Bangalore, May 16-19, 2009
- Participated in two-day workshop on POS tagging for Sanskrit under the Sanskrit Consortia, organized by Dept. of Sanskrit Studies, University of Hyderabad, Hyderabad, June20-21, 2008
- Participated in a series of workshops on editing Hindi-English-Bodo Dictionary, at Central Hindi Directorate during first quarter of 2008
- International Workshop on Referential Entity Resolution, AU-KBC, MIT campus, Anna University, Chennai, Jan15-17,2008
- Review of speech technologies projects, organized by TDIL, MCIT, CDAC- Mumbai, Dec 12, 2007
- Indic languages tagset design, a one day workshop organized at Microsoft Research India Lab, Bangalore, Dec 2, 2007
- Indic languages tagset design, a four day workshop organized at Microsoft Research India Lab, Bangalore, March15-18, 2007
- Indic languages tagset design, a three day workshop organized at Microsoft Research India Lab, Bangalore, Jan19-21, 2007
- Indic languages tagset design, a two day workshop organized at Microsoft Research India Lab, Bangalore, 2-3 Dec, 2006
- Indic languages tagset design, a two day workshop organized at Microsoft Research India Lab, Bangalore, 28-29 Oct, 2006
- Government @27/7: overcoming contextual & cultural challenges , South Asia e-government summit organized at The Grand, Vasant Kunj, Oct 26, 2006 by DIT, GOI, CDAC, ASIA, OSPA forum
Supervision of
Research Students
Projects
- Shallow Parser Tools (SP Tools) for Indian Languages - Maithili, as Director, funded under the Technology Development for Indian Languages (TDIL) program by the Ministry of Communications and Information Technology (MCIT), March , 2012 onwards
- Indian Languages Corpora Initiative (ILCI), as Director and Consortium Leader, funded under the Technology Development for Indian Languages (TDIL) program by the Ministry of Communications and Information Technology (MCIT), March, 2009 onwards
- Developing Computational Tools For Sanskrit And Sanskrit-Hindi Machine Translation, as Director, funded under the Technology Development for Indian Languages (TDIL) program by the Ministry of Communications and Information Technology (MCIT), March 28, 2008 onwards
- Online Multilingual Amarakosa (2005-2007): as Director: funded by JNU under UPOE program of UGC
- Database system for dialects of Hindi (2002-2003): as co-director: Under my supervision , a team of software engineers developed a VB-SQL server software for dialect mapping of Hindi, sponsored and run by MGAHU, New Delhi
- ALED (Active Living Everyday) at Human Kinetics, USA: a dynamic content, interactive website for fitness management 2002: project manager and lead programmer
- Internationalization project of CollaTech e-solutions using JAVA resource bundles. Project involves making the source code compatible to the multilingual locales and languages, 2001
- Database development project at the University of Illinois’s Beckman Institute for Advanced Science and Technology 1998-1999: database programmer
- Analyzing schizophrenic speech at Dept. of Psychology, University Of Illinois, Urbana-Champaign, 1997: research assistant
- Developing technical terminology for Hindi (attended a series of workshops organized by Scientific and Technical Terminology Commission, Min of HRD, 1994-1996) and contributed in developing technical terminology for Hindi
- Computer Assisted Sanskrit Teaching Learning Environment (DoE sponsored project at JNU 1991-1994): research investigator
Summer Training
Trained students of translation technology from MGAHU, Wardha in Computational linguistics techniques for Machine Translation in 2005
Technical Expertise
- Machine Translation, computational lexicography computational grammar development, grammar analysis, interactive linguistic instruction, software internalization/localization
- Natural Language Interface, Attribute Value Grammar (AVG) development, testing, parsing, algorithm implementation
- Language Technology project management
- B2B e-commerce solutions, test drivers in Java Servlet/JSP, AWT/Swing, Apache/Tomcat, JDBC/SQL Server
- RAD database and web applications, design and development of user interface in the windows/NT environment using Delphi, SQL Server, Interbase
- project management in software development
Awards / Honors
-
Nominated to the Editorial Board of Springer for their journal Language Resource and Evaluation (LRE) since Feb 2010.
- Mukesh and Priti Chatter Distinguished Professor of History of Science at University of Massachusetts Dartmouth, USA since 2009.
- Evaluated as Exceptional Programmer by the HKUSA software director, 2002
- Beckman Institute for Advance Science & Technology, University of Illinois Research Assistantship for advanced studies (Illinois, USA) 1998
- Member of the prestigious Cognitive Science group at the University of Illinois (USA): 1997-1999
- Ranked Excellent Teacher twice by students of Hindi at University of Illinois (USA) :1997,1998
- University of Illinois Urbana-Champaign Teaching Assistantship for teaching Hindi language and linguistics to U.S. students: (Illinois, USA) :1997
- Dept. of Psychology, University of Illinois, Urbana-Champaign Research Assistantship (summer 1997) to study schizophrenic speech (Illinois, USA) 1997
|