TechEcho

7 comments

dangover 5 years ago

A thread from 2016: <a href="https://news.ycombinator.com/item?id=12199836" rel="nofollow">https://news.ycombinator.com/item?id=12199836</a>2013: <a href="https://news.ycombinator.com/item?id=5672875" rel="nofollow">https://news.ycombinator.com/item?id=5672875</a>2011: <a href="https://news.ycombinator.com/item?id=2723366" rel="nofollow">https://news.ycombinator.com/item?id=2723366</a>Maybe others?p.s. these links are just for curious readers; reposts are ok after a year or so—see <a href="https://news.ycombinator.com/newsfaq.html" rel="nofollow">https://news.ycombinator.com/newsfaq.html</a>.

adriantamover 5 years ago

This is chapter 1 of "Beautiful Code" (<a href="http://shop.oreilly.com/product/9780596510046.do" rel="nofollow">http://shop.oreilly.com/product/9780596510046.do</a>)

评论 #22319280 未加载

glangdaleover 5 years ago

These posts are great for history, but regular expression implementation has moved on considerably from early Thompson implementations whether backtracking or NFAs. There is a considerable body of literature about regex implementation, including many quite convincing implementations and alternate formulations, some of which weren't even done by people at, or from, Bell Labs!We seem to have something of an Eternal September of regex implementation knowledge (abetted by Russ Cox's amazingly selective bibliography in his posts introducing RE2).

hstaabover 5 years ago

I saw this in the repo of single file language implementations that was posted yesterday. Glad to see it on the front page now.Here’s the repo if anyone is interested in checking out the others:<a href="https://github.com/marcpaq/b1fipl" rel="nofollow">https://github.com/marcpaq/b1fipl</a>

jstimpfleover 5 years ago

Skimming over it, the implementation looks inefficient:<pre><code> int matchstar(int c, char *regexp, char *text) { do { /* a * matches zero or more instances */ if (matchhere(regexp, text)) return 1; } while (*text != '\0' && (*text++ == c || c == '.')); return 0; } </code></pre> That can be used to match short strings, but not to grep through a filesystem. A good regex matcher has runtime O(N*M) where N is the length of the regex (typically very short) and M is the length of the scanned text.

评论 #22322292 未加载

评论 #22326168 未加载

raverbashingover 5 years ago

Seeing code like this I can understand why the pioneers though C code could be pretty.But now all I see there is something more akin to a demonstration of ice carving with chainsaws than something that should be used in a production system.

leohover 5 years ago

This would be a very good technical interview question — i.e. "implement this simple regex specification."

评论 #22319670 未加载

评论 #22320963 未加载

评论 #22321566 未加载

评论 #22319332 未加载

7 comments

dangover 5 years ago

adriantamover 5 years ago

This is chapter 1 of "Beautiful Code" (<a href="http://shop.oreilly.com/product/9780596510046.do" rel="nofollow">http://shop.oreilly.com/product/9780596510046.do</a>)

评论 #22319280 未加载

glangdaleover 5 years ago

hstaabover 5 years ago

jstimpfleover 5 years ago

评论 #22322292 未加载

评论 #22326168 未加载

raverbashingover 5 years ago

leohover 5 years ago

This would be a very good technical interview question — i.e. "implement this simple regex specification."

评论 #22319670 未加载

评论 #22320963 未加载

评论 #22321566 未加载

评论 #22319332 未加载

A Regular Expression Matcher (2007)

7 comments

A Regular Expression Matcher (2007)

7 comments