Archive for July, 2007
The end of my work on French grammar checking
I have finished my first work on LanguageTool. I have adapted the tool to
French grammar checking. The following resume presents the end of this
work.
You can download the report (
Mémoire) and the slides (
Soutenance). They are written in French.
Work on rules
As I explained
previously, at the beginning of the French grammar checker project,
Myriam Lechelt has worked on An Gramadóir. She has written many
disambiguation and correction rules. Since An Gramadóir was limited and
did not really suit to French, it was abandoned.
During my work, I have converted An Gramadóir’s rules to LanguageTool.
Thanks to Marcin Miłkowski who implemented a disambiguator, according
to my instructions, I could import disambiguation rules as well as
correction rules. Moreover, I simplified them a lot and I considerably
reduced their number thanks to the XML language.
Then, I have analysed a corpus of mistakes (V. Lucci et A. Millet, … Read more
Mémoire et slides (Correction grammaticale du français)
Voici mon mémoire de recherche ainsi que les slides de soutenance.
Le travail que j’ai effectué est integré à LanguageTool et utilisable dans
OpenOffice.org en tant qu’extension. Le fichier ainsi que les instructions
d’installation sont disponibles sur le site de LanguageTool.
D’autres personnes ont commencé à écrire de nouvelles règles pour le
français, pour augmenter la couverture de correction de LanguageTool. La
base de règles évolue donc régulièrement.
Il reste encore beaucoup de travail…
(Post originally written by Agnes Souque on the old Nuxeo blogs.)
Resume about the french grammar checker project
I have recently started to work on the project of a free french grammar
checker which could be implemented in OpenOffice.org.
Myriam
Lechelt had initiated this project 2 years ago by adapting Gramadoir, a gaelic grammar
checker developped by Kevin Scanell. But this tool appeared not to be very
suitable for french grammar.
Myriam had also analyzed other grammar checker, amongst which LanguageTool. It is a
rule-based style and grammar checker initially developped for English by
Daniel Naber, and then extended to German, Polish or Hungarian. It was
rejected at the time by Myriam, but it has progressed a lot. So we have
decided to work on the new version for our french grammar checker.
In her work, Myriam has
given leads to create a new grammar checker for French. For example, she
advises to segment sentences in chunks, between the sentence and the word.
She also suggests … Read more
Bilan du travail sur le correcteur grammatical LanguageTool
J’ai terminé mon premier travail sur LanguageTool, qui consistait à adapter
cet outil à la correction grammaticale du français. Voici un résumé qui
présente la fin de ce travail.
Travail sur les règles
Grâce à Marcin Miłkowski qui a développé un désambiguïseur selon mes
indications, j’ai pu convertir au formalisme de LanguageTool toutes les
règles de désambiguïsation et de correction écrites pour An Gramadóir.
Le langage XML m’a permis de beaucoup simplifier les règles et de réduire
leur nombre en faisant des regroupements.
J’ai par la suite analysé un corpus de fautes (V. Lucci et A. Millet, 1994,
L’orthographe de tous les jours, enquête sur les pratiques
orthographiques des français, Éditions Champion) afin d’en extraire de
nouvelles règles de correction.
LanguageTool détecte actuellement des erreurs :
- de proximité phonétique (confusion d’homophones : ça/sa, ont/on,
cet/cette/c’est/ces, etc.)
d’accord dans le syntagme nominal (en genre et en nombre)
- dans le groupe verbal (confusion