Ref:
- B: begin
- M: middle
- E: end
- S: single
Machine Learning:
- Maximum Entropy Markov
- Conditional Random Field (semi-CRF and linear CRF)
Deep Learning:
- Collobert with char embedding
- Max-Margin Tensor Neural Network(MMTNN)
- Gated recursive neural network(GRNN)
- Long Short Term Memory Neural Networks(LSTM)
- GRNN & LSTM
Ref:
- http://technology.chtsai.org/mmseg/
- http://yongsun.me/2013/06/simple-implementation-of-mmseg-with-python/
Simple: find the longest match
complex: find 3-word chunks, choose the 1st chunk with the maximum length
- maximum matching (59.5%)
- largest average word length (30.6%)
- smallest variance of word lengths (1%)
- largest sum of degree of morphemic freedom of one-character words (9%)
Ref:
Ref: *