Hacker News new | past | comments | ask | show | jobs | submit login

If JS a problem for you, try Kantu. It works with screenshots and uses OCR for scraping. The beauty is that it works with any kind of site. But clearly, the speed can not match a node.js or perl based scraper (mechanize etc), so it is not suitable for high volumes.



Do you find it better than Phantom?

Just reading about Kantu now. It reminds me of http://www.sikuli.org/


Yeah, the concept is the same as Sikuli, but all inside Chromium (and the OCR is better).

>Do you find it better than Phantom?

It depends. Once you have a working script, web scraping with Phantom is much faster and much more resource efficient. But since Kantu works visually, you do not have to touch any page source code. That makes it much easier/faster to create the automation in the first place, especially for complex sites with date controls, drag & drop and other Javascript.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: