About
I’m Chihiro Taguchi, a full-time language lover.
I’m a PhD student in Computer Science and Engineering (CSE) at the University of Notre Dame, USA,
as part of the Natural Language Processing group led by Dr. David Chiang.
Quick links
Interests
- Linguistics
- Turkic (Tatar), Quechuan (Kichwa), Japonic (Japanese)
- Syntax (Minimalism, Lexical-Functional Grammar)
- Language documentation
- Anything about languages
- Natural Language Processing & Computational Linguistics
- Speech recognition
- Machine translation
- Language learning
- Mandarin Chinese, Spanish, Italian, Thai
- Let me know if you are interested in language exchange with me.
News
- (July 7, 2024) I completed my first fieldwork in Quito, Ecuador, to research the Kichwa language. The fieldwork involved collecting spoken Kichwa data and analyzing Kichwa morphosyntax.
- (June 21, 2024) I presented my (previously presented) work “Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information” at AmericasNLP 2024 hosted at NAACL 2024!
- (May 22, 2024) I presented my work “Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information” at LREC-COLING 2024! [Kichwa ASR Model] [Killkan Dataset] [Code]
- (May 16, 2024) Our paper “Language Complexity and Speech Recognition Accuracy: Orthographic Complexity Hurts, Phonological Complexity Doesn’t” was accepted at ACL 2024!
- (May 15, 2024) Our paper “Non-discourse-configurationality in Imbabura Kichwa” has been published in the Proceedings of the Linguistic Society of America. [Paper]
- (April 21, 2024) Our joint work “Japanese Rule-based Grapheme-to-phoneme Conversion System and Multilingual Named Entity Dataset with International Phonetic Alphabet” has been accepted at SIGMORPHON 2024 at NAACL 2024!
- (April 16, 2024) I presented my speech-to-IPA project at the Midwest Speech and Language Days at the University of Michigan, Ann Arbor.
- (April 13, 2024) Alianza Quechua 2024 tantanakuypi, kichwa shimipak “rimashkata killkak anta”ta ruray proyectomanta rimarkanimi. Presenté mi trabajo sobre el reconocimiento automático del habla para el idioma kichwa en la reunión anual de la Alianza Quechua. [Diapositivas]
- (April 2, 2024) I was awarded the Kellogg Institute Graduate Student Research Grant!
- (April 2, 2024) Our joint work “Strategies for the Annotation of Pronominalised Locatives in Turkic Universal Dependency Treebanks” has been accepted at the Joint Workshop on Multiword Expressions and Universal Dependencies at LREC-COLING 2024!
- (March 14, 2024) 言語処理学会2024にて口頭発表しました。スライド
- (February 20, 2024) Our paper “Killkan: The Automatic Speech Recognition Dataset for Kichwa with Morphosyntactic Information” has been accepted at LREC-COLING 2024! [Preprint]
- (February 20, 2024) Our joint work “J-SNACS: Adposition and Case Supersenses for Japanese Joshi” has been accepted at LREC-COLING 2024!
- (January 20, 2024) Our joint project “Unifying the Annotations in Turkic Universal Dependencies Treebank” has been accepted at UniDive 2nd meeting.
- (January 10, 2024) Hice una presentación titulada “El conjunto de datos de reconocimiento automático de voz para kichwa” en el Taller Tecnologías Digitales y Lenguas Indígenas.
- (January 7, 2024) I presented a poster titled “Non-discourse-configurationality in Imbabura Kichwa” at the Linguistic Society of America 2024 Annual Meeting. This is a joint work with Jefferson Saransig.
- (November 17, 2023) Presenté mi trabajo “Reconocimiento automático del habla de kichwa para su inclusión y empoderamiento en el ciberespacio” en el VI Seminario Internacional Revitalizando Ando que tuvo lugar en la Universidad Politécnica Salesiana de Quito, Ecuador.
- (September 16, 2023) I gave an informal presentation in our NLP group’s meeting about my recent research updates in linguistics (syntax). I personally think the idea is cool and exciting, so I share my handout here: [handout]
- (September 8, 2023) I gave a talk “UD-Tatar NMCTT Treebank: Issues in Annotation across Turkic UD” at the Turkic UD Workshop! [Slides]
- (August 22, 2023) I presented a poster “Universal automatic phonetic transcription into the International Phonetic Alphabet” at INTERSPEECH 2023!
- (August 4, 2023) I presented my work “Grammaticalization of modal nominal predicates in Tatar” at the 21st International Conference on Turkish Linguistics! [Slides]
- (July 26, 2023) I gave an online talk titled “Bridging Natural Language Processing and Descriptive Linguistics with Universal Dependencies” at the Field Linguistics Workshop hosted by the Tokyo University of Foreign Studies. My slides are available here (in Japanese).
- (July 23, 2023) I presented my poster “Japanese gapless relativization: The syntax–prosody interface to semantics” at the 28th Annual Lexical-Functional Grammar Conference.
- (July 14, 2023) I presented my poster “Incorporating AI-based Speech Transcription into Language Documentation: A case study of Imbabura Kichwa” at the LSA Institute.
- (June 16, 2023) I attended the LSA Institute at the University of Massachusetts, Amherst, for four weeks, with generous support by the Kellogg Institute.
- (May 15, 2023) I started my fieldwork on the Kichwa language in Quito, Ecuador, with generous support by the Center for the Study of Languages and Cultures. I am preparing language resources for computational and educational use, as well as investigating some interesting grammatical phenomena in Kichwa. Stay tuned!
- (March 16, 2023) My poster presentation Universal Dependencies Japanese with Morphological Features at the Annual Meeting of the Association for Natural Language Processing (NLP2023) received the Committee’s Honorable Mention!
言語処理学会2023の私の発表「形態論情報付きUniversal Dependencies」に対して委員特別賞をいただきました。ご興味を持ってくださった皆さん、ありがとうございました。
- (March 14, 2023) I presented two papers (UD Japanese with morphology and Okinawan UD) at the Annual Meeting of the Association for Natural Language Processing (NLP2023) in Okinawa, Japan.
言語処理学会2023にて二つのポスター発表を行いました。
- (March 11, 2023) I presented my work “Introducing Morphology in Universal Dependencies Japanese” at Universal Dependencies Workshop (UDW2023) held at Georgetown University Round Table (GURT2023).
- (February 3, 2023) I am now a doctoral student affiliate at the Kellogg Institute.
- (January 8, 2023) I presented my project “Building an automatic Speech-to-IPA system” at IEEE SLT-Code Hackathon.
- (January 5, 2023) Happy New Year! My first linguistics paper has just been published in The Proceedings of the LFG’22 Conference.
- (October 29, 2022) I gave an online talk at the 33rd South of England LFG Meeting. (Slides)
- (October 13, 2022) I received my Master’s degree in Linguistics by Research with Distinction from the University of Edinburgh!
My thesis is here (Sorry for silly typos in there).
Notre Dame: c[ my surname ] at nd.edu
![itscalledhymmnosfyi](https://user-images.githubusercontent.com/72488381/213342087-609e6bf7-07c2-4a76-b529-710e34e11c1e.png)