A text processing technique that reduces words to their root by chopping off suffixes or prefixes.
Stemming cuts words down to their root form to keep translations and searches consistent. For example, “running,” “runs,” and “runner” might all be reduced to “run.”
This technique is commonly used in search engines, text mining, and early-stage Natural Language Processing (NLP) pipelines to simplify analysis by consolidating different word forms into a single representation.
In localization, helps manage variations of words to maintain consistency and improve translation memory matches. It simplifies complex word forms, making it easier for translation systems to recognize and handle related words effectively.
Unlike lemmatization, stemming often ignores grammatical correctness, focusing instead on fast and approximate reductions.