This is the mail archive of the mailing list for the GCC project.

Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]
Other format: [Raw text]

Re: Repository for the conversion machinery

Richard Biener <>:
> Not sure why we can't label the individual commits with Authors scraped
> from the ChangeLog entries in that commit.  Some commits even have
> multiple authors after all!  And if that fails I'd rather use the
> identity.

Because associating ChangeLog entries with repo commits is really
hard.  You talk as though there's a neat 1-1 mapping with every commit
containing one correctly-written ChangeLog comment.  That's never the
case in the wild, and any plan that assumes it will be is doomed.

I've been to this rodeo before on other GNU projects and the problem
is pretty much AI-complete.  That is, a human can do it relatively easily by
applying contextual knowledge, a computer program can't.

We can't count on the dates to match. There's a whole world of pain
there beginning with the fact that the ChangeLog timestamp and the
commit timestamp can easily be generated across opposite sides
of a top-of-second even if the ChangeLog timestamp was made by a Lisp
hook in Emacs.  And continuing with timezone and DST fooups.

We also can't count on the Subversion username of the commit to match any
address in the ChangeLog comment.  In fact this is the exact problem we
started out trying to solve.
		<a href="";>Eric S. Raymond</a>

Index Nav: [Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav: [Date Prev] [Date Next] [Thread Prev] [Thread Next]