Service Packages

If some sources are particularly important to you, we offer additional manual services to further ensure your continuous delivery of news data.

Service Packages

If some sources are particularly important to you, we offer additional manual services to further ensure your continuous delivery of news data.

Priority sources

Every weekday
We monitor the daily volume of articles to detect deviations. If we see an unusual deviation, we will review our crawling configuration to make sure that everything is working correctly.

Every week
We screen the Google Index to determine whether Google has found articles that we have not. If that is the case, we will refine our crawling configuration to catch those articles.

CAPTCHA bypass

In rare cases our crawler is blocked by CAPTCHA tests. We offer a manual CAPTCHA bypass service for customers who need all articles from blocked sources.

The price of service is determined by the number of articles blocked by CAPTCHA.

Paywall content

We can deliver content from behind paywalls, if:

  1. The customer has signed an agreement with the publisher, and
  2. The customer provides Opoint with login credentials.

We monitor the functionality of the login credentials at all time. Content which is fetched using a partiular login will be delivered exclusively to the customer who provided the login.

Daily manual quality checks

For sites in service packages we

  • look for quality issues
  • have lower thresholds for assessing something as a probable quality issue
  • compare our article harvest with what sites have published whenever a probable issue is detected
  • take appropriate actions whenever the comparison shows quality issues

To find probable issues on each site we look at:

  • changes in daily publishing volumes to detect sites where we miss articles
  • downloading failures to detect sites where articles are found but not published by us
  • changes in daily average article lengths to detect sites where we read too little from article pages
  • common article titles, text snippets, and text tails to detect sites where we read outside article borders

Contact us

Do you want to know more, please contact us for a chat about your challenges and needs.

Download