TE
TechEcho
Home24h TopNewestBestAskShowJobs
GitHubTwitter
Home

TechEcho

A tech news platform built with Next.js, providing global tech news and discussions.

GitHubTwitter

Home

HomeNewestBestAskShowJobs

Resources

HackerNews APIOriginal HackerNewsNext.js

© 2025 TechEcho. All rights reserved.

MyHTML – HTML Parser on Pure C with POSIX Threads Support

123 pointsby yxlxabout 9 years ago

7 comments

leeoniyaabout 9 years ago
&gt; By the way, SCRIPT tag tokenization is a hell of an effort. I had to draw a <i>graph</i> [...] Next in turn are the CSS parser and Renderer.<p>CSS parsing should be ok, but layout computation is hard, especially with all the latest specs. The graph presented in the article will be the size of a postage stamp on an aircraft carrier deck.<p>Take a look at the Cassowary constraint solver, btw: <a href="http:&#x2F;&#x2F;overconstrained.io&#x2F;" rel="nofollow">http:&#x2F;&#x2F;overconstrained.io&#x2F;</a><p>&gt; I&#x27;m writing them all by myself, still full of energy.<p>I wish the author the best of luck.
评论 #11271421 未加载
scrollawayabout 9 years ago
This is incredibly clean code. Large, long-term single-person hobby projects make for some kickass codebases. Well done.
lxeabout 9 years ago
Amazing work. How does this compare (in terms of speed mostly) to Google&#x27;s gumbo parser?
评论 #11271592 未加载
评论 #11271489 未加载
legulereabout 9 years ago
<a href="https:&#x2F;&#x2F;github.com&#x2F;servo&#x2F;html5ever" rel="nofollow">https:&#x2F;&#x2F;github.com&#x2F;servo&#x2F;html5ever</a> seems to also have a (not yet complete) C API
mablaeabout 9 years ago
Putting &quot;my&quot; in front of anything should be forbidden.<p>Just &quot;my&quot; two cents.
评论 #11272681 未加载
agumonkeyabout 9 years ago
Interesting to see, just took handmade xml parser as a personal challenge, in python though, I&#x27;ve been hitting nasty performance issues compared to libxml2.
评论 #11273370 未加载
chris_wotabout 9 years ago
I wonder how easy it would be too adapt the API to a set of C++ classes?