Open Tamil Texts for Machine Processing
This event is a physical and virtual event hosted by the Digital Tamil Studies project at UTSC Library and is part of the Digital Scholarship @UTSC Series.
Date: Saturday, January, 18th, 2020
Time: 9:30 am - 11:30 am [8.00 pm - 10:00 pm IST - India/Sri Lankan Time]
Presenters and discussion may be conducted in Tamil.
Location: The BRIDGE Boardroom: IC 111
University of Toronto Scarborough Campus (UTSC)
Instructional Centre (IC Building), ground floor
1095 Military Trail
Toronto, Ontario M1C 1A4
Zoom link: https://zoom.us/j/299051550
High-quality open Tamil language texts are difficult to source, often requiring substantial cleaning/processing in order to be used for digital scholarship and application development purposes. The Digital Tamil Studies project, based at the UTSC library has been developing partnerships to create better quality Tamil text data for machine processing. This subject is intertwined with many other activities such as text analysis, natural language processing, development of multilingual digital repositories and the Digital Tamil Studies community writ large. Please join us for a roundtable with Tamil computing practitioners and users discussing projects and developments in this area.
- Current State of Open Tamil Datasets - Ravi Annaswamy (Information Architect)
- Python Libraries for Tamil Computing - Muthu Annamalai (Software Engineer)
- Linked and Structured Tamil Data for Machine Learning - Saatviga Sudhahar (Machine Learning Scientist)
- Tamil Computing Needs for Libraries - Natkeeran L. Kanthan (Applications Developer)
Presentations will be followed by discussions.
Tamil Scholars and Computing Experts
Visiting UTSC for the first time? A campus map and information on public transportation and visitor parking can be found at https://www.utsc.utoronto.ca/home/visiting-utsc.
- Saturday, January 18, 2020
- 9:30am - 11:30am
- The Bridge, UTSC