Subject: [boost] [gsoc 2013] Approximate string matching
From: Jan Strnad (hanny.strnad_at_[hidden])
Date: 2013-04-12 05:52:56
Hi, I'm Jan Strnad, currently a Master's student at CTU in Prague, Czech
Republic. This summer, I would like to participate in GSOC and I believe
that approximate string matching is a good match for me, since my
background i this area is quite strong.
Now more specifically. I would like to implement approximate string
matching algorithm(s) based on the NFA. I would like to support various
approximate distances, such as Hamming distance , Levenstein distance
, Damerau-Levenstein distance , Delta distance, Gamma distance and
finally (delta, gamma) distance. Sorry no explanation of the last three
found on the Web, but I can provide one if interested.
In the matter of NFA implementation, I would probably choose dynamic
programming approach, but I'm thinking of shift-or algorithm as well.
Can I ask you for any kind of thoughts or feedback regarding this topic?