科技回声

8 条评论

peteretep大约 10 年前

Go has some weird syntactic sugar including where a method invocation is rewritten by the compiler to pass in a value or a pointer depending on what the callee wants(!?!). And yet Go code is still littered with:<pre><code> if err != nil { </code></pre> ... rather than some simple, compile-time validated sugar to pass the error value up the call chain. Yes, I've read the justification documents. No, they still don't make a very convincing argument.

评论 #9596055 未加载

评论 #9598212 未加载

rdudekul大约 10 年前

To me goquery seems more intuitive than scrape, may be because I am more familiar with jquery selectors syntax.Any reason why yhat guys (ericchiang) created Scrape (and not use say goquery)?Can you make the matcher function in main.go go away with a simpler (more intuitive) interface/api/dsl?

评论 #9596373 未加载

评论 #9596254 未加载

jwcrux大约 10 年前

I like goquery[1] for doing this type of thing.[1] <a href="https://github.com/PuerkitoBio/goquery" rel="nofollow">https://github.com/PuerkitoBio/goquery</a>

thinxer大约 10 年前

I'd like to introduce htmlutil[1] and cascadia[2] for DOM processing in Go which is useful in scraping articles.[1]: <a href="https://github.com/thinxer/go-htmlutil" rel="nofollow">https://github.com/thinxer/go-htmlutil</a>[2]: <a href="https://github.com/andybalholm/cascadia" rel="nofollow">https://github.com/andybalholm/cascadia</a>

headzoo大约 10 年前

Selfless plug.. May also want to check out Surf for web scraping.<a href="https://github.com/headzoo/surf" rel="nofollow">https://github.com/headzoo/surf</a> Docs: <a href="http://www.gosurf.io/" rel="nofollow">http://www.gosurf.io/</a>Among other things goquery is baked in to easily select page elements using CSS selectors.

chrissnell大约 10 年前

This is very cool. I'm not much of a front-end guy so I'm struggling with the examples. Would you mind posting up a simple example that will scrape--say--the first TD tag of every row of a table? Thanks.

评论 #9595331 未加载

lunixbochs大约 10 年前

Nice! See also <a href="https://github.com/andrew-d/goscrape" rel="nofollow">https://github.com/andrew-d/goscrape</a>

bjblazkowicz大约 10 年前

supporting xpath?

评论 #9595615 未加载

8 条评论

peteretep大约 10 年前

评论 #9596055 未加载

评论 #9598212 未加载

rdudekul大约 10 年前

评论 #9596373 未加载

评论 #9596254 未加载

jwcrux大约 10 年前

I like goquery[1] for doing this type of thing.[1] <a href="https://github.com/PuerkitoBio/goquery" rel="nofollow">https://github.com/PuerkitoBio/goquery</a>

thinxer大约 10 年前

headzoo大约 10 年前

chrissnell大约 10 年前

评论 #9595331 未加载

lunixbochs大约 10 年前

Nice! See also <a href="https://github.com/andrew-d/goscrape" rel="nofollow">https://github.com/andrew-d/goscrape</a>

bjblazkowicz大约 10 年前

supporting xpath?

评论 #9595615 未加载

Scrape: A simple, higher level interface for Go web scraping

8 条评论

Scrape: A simple, higher level interface for Go web scraping

8 条评论