About
I'm Tomaž Kovačič - Backend Software Enginner at Zemanta with substantial experience building web based applications on various technology stacks.
My thesis research was dedicated to web content extraction algorithms where I conducted a comprehensive study of existing solutions in this sparse field.
Categories
- text extraction (5)
-
Recent Posts
Tag Archives: comparison
Feature-wise Comparison of HTML Article Text Extractors
In one of my previous posts I compiled quite a decent list of software (and other resources) all capable of extracting article content from an arbitrary HTML document. While I was gathering all the relevant papers and software I kept … Continue reading
Posted in text extraction
Tagged comparison, information retrieval, semantic web, software, text extraction, web api
View Comments