|
sentencepiece
☆
« Back to VersTracker
|
||||||||||
|
Description: Unsupervised text tokenizer and detokenizer |
||||||||||
| Type: Formula | Tracked Since: Dec 28, 2025 | ||||||||||
| Links: Homepage | formulae.brew.sh | ||||||||||
| Category: Ai ml | ||||||||||
| Tags: nlp tokenization machine-learning bpe ai | ||||||||||
| Install: brew install sentencepiece | ||||||||||
|
About: SentencePiece is an unsupervised text tokenizer and detokenizer primarily for Neural Network-based text processing systems. It implements subword segmentation algorithms like BPE and Unigram, enabling efficient handling of large vocabularies and out-of-vocabulary words. This library is essential for training modern NLP models such as BERT and T5. |
||||||||||
Key Features:
|
||||||||||
Use Cases:
|
||||||||||
Alternatives:
|
||||||||||
| Version History | ||||||||||
|