Text Preprocessing
Build text preprocessing pipelines optimized for your NLP task.
Usage
Describe your text data and NLP task to get a preprocessing pipeline.
Examples
- "Preprocess tweets for sentiment analysis"
- "Clean and normalize product reviews for topic modeling"
- "Build a text preprocessing pipeline for document classification"
Guidelines
- Choose preprocessing steps based on your downstream task
- Preserve case for sentiment analysis and named entity tasks
- Handle contractions and special characters consistently
- Consider subword tokenization for neural models
- Test how each preprocessing step affects model performance