This is the mail archive of the
gcc@gcc.gnu.org
mailing list for the GCC project.
Re: List of typos.
- From: OndÅej BÃlka <neleai at seznam dot cz>
- To: Jonathan Wakely <jwakely dot gcc at gmail dot com>
- Cc: "gcc at gcc dot gnu dot org" <gcc at gcc dot gnu dot org>, Veres Lajos <vlajos at gmail dot com>
- Date: Sat, 6 Jul 2013 07:44:17 +0200
- Subject: Re: List of typos.
- References: <20130703062317 dot GA21956 at popelka dot ms dot mff dot cuni dot cz> <20130703062425 dot GB21956 at popelka dot ms dot mff dot cuni dot cz> <alpine dot DEB dot 2 dot 00 dot 1307030946410 dot 8850 at citymarket dot hu> <20130703084440 dot GA11873 at virgil dot suse> <alpine dot DEB dot 2 dot 00 dot 1307031107230 dot 8850 at citymarket dot hu> <alpine dot DEB dot 2 dot 00 dot 1307040438220 dot 8850 at citymarket dot hu> <alpine dot DEB dot 2 dot 10 dot 1307040903350 dot 25651 at stedding dot saclay dot inria dot fr> <CAH6eHdT4OJ6J-wPAbXQn6RzKUcTBC9H8TqmDXtdQT_efn3FAtA at mail dot gmail dot com> <20130705154318 dot GA26780 at domone dot kolej dot mff dot cuni dot cz> <CAH6eHdTb_KBzAhf1Wnbhrdj2T4+TEuoMd5__AkPKGj_X8v9hmQ at mail dot gmail dot com>
On Fri, Jul 05, 2013 at 05:17:54PM +0100, Jonathan Wakely wrote:
> On 5 July 2013 16:43, OndÅej BÃlka wrote:
> >
> > Hi, I ran aspell on comments in gcc. After bit of cleaning a list with
> > frequencies is here. It is still relatively noisy and more heuristics
> > are needed.
> >
> > http://kam.mff.cuni.cz/~ondra/gcc_misspells
> >
> > What we will do with this now?
>
> It doesn't look very useful yet, clearly "namespace" and "param" are not errors.
We need to teach aspell about these. I am thinking about creating shared
wordlist that will gcc developers use. It is mainly logistics problem, I
could imagine having shared file on sourceware and using script like
this.
scp remote_wordlist wordlist
aspell merge english wordlist
aspell -m wordlist -p new
scp remote_wordlist wordlist # To decrease race conditions.
aspell merge wordlist new
scp wordlist remote_wordlist
>
> "acccepted" and "accestor" and "actullay" are real spelling mistakes,
> but someone will have to do a grep through the whole tree to see where
> they come from, and then ignore all the ones in ChangeLog files.
If I could extract score from which aspell determines candidate I can
sort them from most likely ones. I tried to write to aspell-user but got no
response yet.
This touches only comments, not changelogs.
Ondra