LuisMC

Chinese Intelligent Language Learning (CHILL)

Using Grammar Engineering Models for Automated Error Detection

Introduction

This Marie Skłodowska-Curie Action is hosted by Palacký University Olomouc and aims to advance the research in high-precision error detection for Mandarin Chinese. The action has two main goals:

First, this action works towards an in-depth syntactical analysis and computational implementation of Mandarin Chinese Noun Phrases (NPs). It uses an implementable framework (Head-Driven Phrase Structure Grammar, HPSG) to design and test high-precision linguistic analyses for Mandarin Chinese text. The linguistic analysis will pay particular attention to NP quantification, reference/deixis, cognitive status and modification. The computational implementation of theoretical analyses will be made available as part of the Mandarin Resource Grammar.
Second, this action will also work towards exploiting the computational implementations, produced as part of this project, to create a high-precision error detection parser for Mandarin Chinese. To accomplish this, we use a concept known as "mal-rules". Mal-rules are able to detect grammatical errors by allowing violation/relaxation of linguistic constraints. Mal-rules are able to pinpoint, with high precision, the source of language problems, and can be used to correct or provide corrective feedback to ungrammatical sentences.

This action is strongly data-driven, and employs an interdisciplinary methodology, integrating standard methodologies in formal linguistic analysis with those employed in software development, in Computer Science. It is broadly inserted in the context of DELPH-IN — an international consortium that shares a commitment to develop open-source resources for deep linguistic processing using Head Driven Phrase Structure Grammar and Minimal Recursion Semantics.

You can also find more information about this project through CORDIS (EU Reserach Results).

Team

Luis

Luis Morgado da Costa:
Luis is the ‘excellent fellow’ of this action. He is an interdisciplinary researcher with a strong background in computational linguistics. He recently completed his PhD in Using Rich Models of Language in Grammatical Error Detection, from the Interdisciplinary Graduate Program at Nanyang Technological University, Singapore. Luis brings ample expertise in computational linguistics, grammar engineering and in the development of grammatical error detection applications targeting educational contexts.
lmo rgad o.dac ostagmail.com / lui s.mor gadod acostaupol.cz

Joanna's picture

Joanna Ut-Seong Sio:
Joanna is the supervisor of this action. She is a linguist and comedian, originally from Hong Kong. She received her PhD from Leiden University, in the Netherlands, where she worked on modification and reference in the Chinese nominal. Her research interests include Chinese languages, especially in the area of syntax and semantics, as well as the use of verbal arts in the training of communication skills. She brings her unparalleled knowledge concerning syntax of the Chinese NP to this project.
joa nnau tseo ng.sioupol.cz

Hana's picture

Owsiankova Hana:
Hana is the project officer of this action. She is a project manager of national and Horizon Europe projects and mobility coordinator at the Faculty of Arts, Palacký University in Olomouc. Her main interests are science diplomacy and science business. Hana provides invaluable administrative support to the execution of this action.
han a.ows ian kovaupol.cz

Latest Events and Milestones

(2022.11.02-04) Invited talk at the Final Sinophone Conference: "What is Hua?" UPOL, Olomouc, Czech Republic.:
I was invited to participate in a panel on Language Diversity — Promoting Cantonese in and through the Digital World. This panel was led by Joanna Sio, my MSCA supervisor. Joanna and I gave a joint talk titled "The Cantonese Wordnet: Recent Development and Challenges". This panel also included other talks from esteemed colleagues, such as a talk by Andy Chi-on Chin (The Education University of Hong Kong) titled "From Humanities to Digital Humanities: Cantonese Studies in the Big Data Era" and a talk by Zoe Lam and Raymond Pai (University of British Columbia) on "Cantonese Language Curriculum Design for a Diverse Student Population: A Case study of a Canadian University". This was a great opportunity for networking. A joint publication summarizing the discussion we had in this panel (authored by all panelists) is planned for the near future.

(2022.11.03) Invited talk at We Connect 2022. Innovation & Business Sustainability. Ris3ok. Olomouc, Czech Republic.:

I was invited by the Innovation Center of the Olomouc Region to participate in this conference directed at researchers, businesses and policymakers. The conference aimed at discussing various areas of Innovation & Business Sustainability being actively researched/developed in the Olomouc region (where UPOL is situated). I gave a joint (bilingual) talk with Francis Bond and Frantisek Kratochvil. Our talk was titled "Machines that Understand and Teach Us" (Czech title: "Stroje, které nám rozumí a učí nás"). We received a lot of positive attention, and it was a great venue to network.

(2022.11.01) Invited Seminar for the Department of Linguistics and Modern Languages, City University of Hong Kong (CUHK), China.:
I had the honor of being invited to participate in the renowned Seminar Series hosted by the Department of Linguistics and Modern Languages, City University of Hong Kong. In a joint talk with my supervisor, Joanna Sio, we discussed our goals and experience in developing the Cantonese Wordnet Project. We also discussed some key aspects of digital lexicology, such as data formats/curation/sharing in view of sustainability, corpus-based lexicography, and some potential applications of wordnets in education. It was a very productive seminar, gathering support from attendees — many of whom showed interest in future collaborations.

See All Events and Milestones

Technical Details: Grammatical Error Detection

This project uses constraint-based linguistic language models (i.e., computational grammars) to explicitely model common grammatical errors made by learners of Mandarin Chinese. It implements a theoretical concept known as mal-rules to identify and reconstruct ungrammatical sentences with enough precision to perform grammatical error detection, and to provide clear linguistic explanations of why a given sentence is ungrammatical.

Show/Hide Details

EU flag

EU flag

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No.101028782.