This is the mail archive of the libstdc++@gcc.gnu.org mailing list for the libstdc++ project.

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]
Other format:	[Raw text]

Re: Search algorithms in __gnu_cxx::

From: Paolo Carlini <pcarlini at suse dot de>
To: Dhruv Matani <dhruvbird at gmail dot com>
Cc: libstdc++ <libstdc++ at gcc dot gnu dot org>
Date: Fri, 15 Sep 2006 13:48:43 +0200
Subject: Re: Search algorithms in __gnu_cxx::
References: <3a9148b90609061012i22cb2aa3led317334ebbac0a3@mail.gmail.com> <45013E38.5020400@suse.de> <3a9148b90609130546l1d6e644am61984d6c82c36b57@mail.gmail.com> <45080114.200@suse.de> <3a9148b90609130656ofe0d9f3wb5eca188fae22953@mail.gmail.com> <45081067.1000101@suse.de> <3a9148b90609141100k6b98c5a8g1d3011c67bb0e793@mail.gmail.com> <4509C317.2050208@suse.de> <3a9148b90609150009v1595c80oe753a6d68d15b36e@mail.gmail.com> <450A5AA1.2070901@suse.de> <3a9148b90609150440p75effeb0v69cc967d02af79be@mail.gmail.com>

Dhruv Matani wrote:

On this page:
http://www.movsd.com/bm.htm
you can find a complete description of the 3 different kinds of tables
which _may_ be used in any Boyer Moore algorithm implementation. I am
using the Good Character & Bad Character shift tables ONLY. As you can
see from the description, that's perfectly fine to do.

To be clear, it's not about *me*, it's about implementing something well known and of known properties. That means, the pointer above must be part of the documentation, as a comment in the code, at least. Likewise for any other extension we may add in the future.

Also, I think we have to do something for that unordered_map, I don't
think covering the general case via lookups in a map leads to something
close to the original spirit, in any possible sense...

Nope, it doesn't because the algorithm wasn't meant for any search,
but was designed for string searching in particular. Hence the use of
the static tables, etc.... However, using the unordered_map<> as an
approximation to preserve generality will only hurt the performance
and not the correctness. Again, since this is an extension, people
will use it only for strings if we document it accordingly.

No, we don't want to do that. Because using a map like that means that all the complexity figures of the Boyer-Moore algorithm are not valid anymore. Therefore, either we find a better way to deal with that problem (I suggest we don't give up so quickly, no reason to rush) or we have to deliver the algorithm only for chars.

Paolo.

Follow-Ups:
- Re: Search algorithms in __gnu_cxx::
  - From: Dhruv Matani

References:
- Search algorithms in __gnu_cxx::
  - From: Dhruv Matani
- Re: Search algorithms in __gnu_cxx::
  - From: Paolo Carlini
- Re: Search algorithms in __gnu_cxx::
  - From: Dhruv Matani
- Re: Search algorithms in __gnu_cxx::
  - From: Paolo Carlini
- Re: Search algorithms in __gnu_cxx::
  - From: Dhruv Matani
- Re: Search algorithms in __gnu_cxx::
  - From: Paolo Carlini
- Re: Search algorithms in __gnu_cxx::
  - From: Dhruv Matani
- Re: Search algorithms in __gnu_cxx::
  - From: Paolo Carlini
- Re: Search algorithms in __gnu_cxx::
  - From: Dhruv Matani
- Re: Search algorithms in __gnu_cxx::
  - From: Paolo Carlini
- Re: Search algorithms in __gnu_cxx::
  - From: Dhruv Matani

Index Nav:	[Date Index] [Subject Index] [Author Index] [Thread Index]
Message Nav:	[Date Prev] [Date Next]	[Thread Prev] [Thread Next]