On the (In)Effectiveness of Large Language Models for Chinese Text Correction

07/18/2023
by   Yinghui Li, et al.
0

Recently, the development and progress of Large Language Models (LLMs) have amazed the entire Artificial Intelligence community. As an outstanding representative of LLMs and the foundation model that set off this wave of research on LLMs, ChatGPT has attracted more and more researchers to study its capabilities and performance on various downstream Natural Language Processing (NLP) tasks. While marveling at ChatGPT's incredible performance on kinds of tasks, we notice that ChatGPT also has excellent multilingual processing capabilities, such as Chinese. To explore the Chinese processing ability of ChatGPT, we focus on Chinese Text Correction, a fundamental and challenging Chinese NLP task. Specifically, we evaluate ChatGPT on the Chinese Grammatical Error Correction (CGEC) and Chinese Spelling Check (CSC) tasks, which are two main Chinese Text Correction scenarios. From extensive analyses and comparisons with previous state-of-the-art fine-tuned models, we empirically find that the ChatGPT currently has both amazing performance and unsatisfactory behavior for Chinese Text Correction. We believe our findings will promote the landing and application of LLMs in the Chinese NLP community.

READ FULL TEXT
research
07/08/2023

Evaluating the Capability of Large-scale Language Models on Chinese Grammatical Error Correction Task

Large-scale language models (LLMs) has shown remarkable capability in va...
research
08/03/2023

Does Correction Remain A Problem For Large Language Models?

As large language models, such as GPT, continue to advance the capabilit...
research
10/25/2022

A Chinese Spelling Check Framework Based on Reverse Contrastive Learning

Chinese spelling check is a task to detect and correct spelling mistakes...
research
08/27/2020

Adaptable Filtering using Hierarchical Embeddings for Chinese Spell Check

Spell check is a useful application which involves processing noisy huma...
research
06/28/2023

An Adversarial Multi-Task Learning Method for Chinese Text Correction with Semantic Detection

Text correction, especially the semantic correction of more widely used ...
research
07/11/2023

GujiBERT and GujiGPT: Construction of Intelligent Information Processing Foundation Language Models for Ancient Texts

In the context of the rapid development of large language models, we hav...
research
04/17/2023

Efficient and Effective Text Encoding for Chinese LLaMA and Alpaca

Large Language Models (LLMs), such as ChatGPT and GPT-4, have revolution...

Please sign up or login with your details

Forgot password? Click here to reset