cwb3
« Back to VersTracker
Description:
Tools for managing and querying large text corpora with linguistic annotations
Type: Formula  |  Latest Version: 3.5.0@0  |  Tracked Since: Dec 17, 2025
Links: Homepage  |  formulae.brew.sh
Category: Other
Tags: linguistics corpus-linguistics nlp text-analysis cqp
Install: brew install cwb3
About:
CWB3 is a powerful suite of tools for managing and querying large text corpora annotated with linguistic data. It provides a high-performance corpus query engine and utilities for building and maintaining corpus databases. Its main value is enabling complex linguistic searches and analysis on massive datasets.
Key Features:
  • High-performance corpus query language (CQP)
  • Efficient storage and indexing of annotated text
  • Command-line tools for corpus creation and management
  • Client-server architecture for remote access
  • Supports complex linguistic queries (e.g., part-of-speech, syntax)
Use Cases:
  • Linguistic research and corpus analysis
  • Natural Language Processing (NLP) dataset creation and validation
  • Digital humanities text analysis
  • Lexicography and dictionary compilation
Alternatives:
  • BlackLab – Java-based server with similar functionality, often used with Solr for indexing.
  • Sketch Engine – Commercial web-based platform offering corpus management and query tools.
License: GPL-2.0-or-later
Dependencies: pcre, glib, readline
Bottles available for: arm64_tahoe, arm64_sequoia, arm64_sonoma, arm64_ventura, arm64_monterey, arm64_big_sur, sonoma, ventura, monterey, big_sur, catalina, arm64_linux, x86_64_linux
Important Notes:
CWB default registry directory: $HOMEBREW_PREFIX/share/cwb/registry
Version History
Detected Version Rev Change Commit
Sep 14, 2025 7:36pm 0 VERSION_BUMP 6fb21800
Jan 7, 2025 12:04pm 0 VERSION_BUMP ff2d27ee
Nov 17, 2024 8:37pm 0 VERSION_BUMP 85dc9baf