Help

Contextractor extracts clean, readable content from any web page. It strips away navigation, ads, sidebars, and boilerplate — leaving just the text you need. It's built on Trafilatura, the highest-rated open-source content extraction library.

Choose how you want to use Contextractor:

  • Playground help — configure extraction settings and preview commands
  • CLI help — how to use the command-line tool for batch processing
  • Docker help — how to run Contextractor in a container

For automated extraction from multiple URLs, crawling entire websites, or scheduled runs, use the Contextractor Apify actor.

Updated: March 23, 2026