[[
wikihub
]]
Search
⌘K
Explore
People
For Agents
Sign in
Explore
People
For Agents
Sign in
@jemoka / Jemoka Knowledge Base / raw/concept/kbhmorpheme.md
Suggest edit
Cancel
Submit suggestion
Title
Name
Note
--- title: "morpheme" source: https://www.jemoka.com/posts/kbhmorpheme/ --- A morpheme is the smallest meaning-bearing unit of a language. “er”, or “ist”, etc. It contains: stems: core meaning-bearing units, and affexes: parts that adhere to stems For non space-delineated languages, tokenization happens with morpheme (“词”). Consider: 姚明进入总决赛 Is yao/ming first and last names seperated. Is zong combined with juesai? (i.e. ADJ vs. NOUN). Commonly, Chinese performs word level tokenization if you don’t want to deal with it. Typically, this usuals neural sequence models.