massOfai

WordPiece

NLP & Text

Tokenizer algorithm used by BERT

What is WordPiece?

Builds subword vocabulary optimizing for likelihood; used in BERT and other models for robust tokenization.

Real-World Examples

  • BERT tokenization pipeline

Related Terms

Learn more about concepts related to WordPiece