Elements and features Pinyin input method




1 elements , features

1.1 conversion length
1.2 treatment of tones
1.3 treatment of extended latin characters (ü , ê)
1.4 treatment of hm, hng, ng, n
1.5 usage statistics , user dictionaries
1.6 abbreviation
1.7 fuzzy pinyin
1.8 word prediction
1.9 double pinyin
1.10 typo correction
1.11 language mixing





elements , features

pinyin input methods differ in number of possible aspects. pinyin input methods provide convenience features speed input. of these features can speed typing immensely.


conversion length

the basic idea of input method have buffer holds user input until converted characters otherwise unavailable keyboard.


in basic systems, 1 character converted @ time. makes time consuming input process. not user have select characters 1 @ time, means input system not have ability prioritize character choices using word phrases, grammatical structure, or context. in addition, since input method supports 1 character @ time, requires user type out full pinyin spelling narrow down selection. system still exists in embedded applications such cell phones.


common pinyin implementations on computer today can hold clause in pinyin before requiring conversion. method attempts guess appropriate characters using word phrases dictionary, grammatical structure, , context.


treatment of tones

chinese tonal language. tones can used further distinguish characters of same sound. many of single-character pinyin method implementations required input of tones in order narrow down character selection.


for sake of convenience, tone selection disabled default in modern pinyin systems on computer. user may have option enable depending on pinyin implementation.


treatment of extended latin characters (ü , ê)

with exception of intonation, there 2 extended latin vowels in pinyin. ü (u-umlaut) , ê (e-circumflex). given keyboard layout common keyboard layout in china, pinyin method implementation need able facilitate input of vowels on keyboard.


since letter v unused in mandarin pinyin, universally used alias ü. example, typing nv input method bring candidate list pinyin: nǚ.


the handling of ê not universal, since character 欸 commonly used character pronunciation. interjection equivalent eh in english. imes, such google pinyin, merge e , while others create additional letter combination it, such ea or eh , or ei in ios. others drop sound.


treatment of hm, hng, ng, n

the character 嗯 (ng) can (or should) written using ibus linux , microsoft input method typing en .


usage statistics , user dictionaries

most modern input method implementations adjust positions of word candidates in candidate list based on prior usage statistics. in addition, input method support user-defined phrases via user dictionary.


abbreviation

abbreviation feature allows user omit first or first couple of letters in pinyin spelling. feature can speed input of long word phrases significantly. under feature, user can enter word concert (simplified chinese: 音乐会; traditional chinese: 音樂會; pinyin: yīnyuèhuì) typing yyh opposed yinyuehui .


in systems support user-defined phrases, users can define own abbreviations might not follow standard pinyin rules.


fuzzy pinyin

pinyin created based on pronunciation of standard chinese, variety of mandarin chinese. regional accents prevalent in mandarin among both native , nonnative speakers. means significant number of mandarin speakers have trouble distinguishing number of similar-sounding syllables of pinyin, such c , ch, s , sh, z , zh, n , ng, h or hu , f, or n , l. fuzzy pinyin or fuzzy input (模糊音) feature allows user input similar-sounding vowels or consonants if same thing. has disadvantages user must choose correct characters or words longer list of homophones .


word prediction

word prediction (simplified chinese: 联想; traditional chinese: 聯想; pinyin: liánxiǎng; literally: association ) feature of input method attempts guess next series of characters user attempting enter. feature used refer 2 different mechanisms have similar functions.


one of these mechanisms akin auto-complete function user input. while user typing appropriate pinyin, input method take input , possible word phrases might match user input though input incomplete. example, when user enters shang , input method show 上海 (shanghai) word candidate under feature.


the second possible mechanism prediction of user s next input after user completes entering set of words. example, in above example, after user selects 上海 (shanghai) word candidate list, input method s pinyin buffer empty. under mechanism, input method display list of words follows word shanghai, such 人 (people), 市 (city), 的 (an auxiliary word).


double pinyin

the default double pinyin scheme in microsoft pinyin ime. many ime, including ibus-pinyin, support scheme.


vowel groups in pinyin can 4 letters long. double pinyin (双拼) method whereby longer vowel groups assigned consonant keys shortcuts, , zh, ch, sh assigned vowel keys shortcuts. thus, when input method expects vowel, user can use shortcuts speed typing.


in microsoft pinyin ime, example, if user wants input “中华人民共和国 (zhōnghuárénmíngònghéguó)”, people s republic of china computer, need type zhonghuarenmingongheguo in full pinyin. in double pinyin, however, 1 need type vshwrfmngshego (v=zh, s=ong,h=h, w=ua,r=r, f=en,m=m, n=in, g=g, s=ong, h=h, e=e, g=g, o=uo).


typo correction

similar automatic typo correction english in word processors, pinyin method implementations can recognize possible typos , show appropriate word candidates. using google pinyin example, when encountering suspected typo, google pinyin show both word candidates assuming correct , word candidates assuming typo.


language mixing

most advanced pinyin method implementations allow mixing of english input stream without requiring user change language mode. however, comes limitations such requiring input uppercase.


the following examples show difference if user wishes enter 这个sql漏洞可以瘫痪整个系统。 (this sql vulnerability paralyze entire system.):



zhe ge [switch english] sql [switch chinese] loudong keyi tanhuan zhengge xitong. (unsupported)
zhe ge sql loudong keyi tanhuan zhengge xitong. (supported)




^ http://news.xwh.cn/news/system/2013/05/16/010353431.shtml






Comments

Popular posts from this blog

Expenses controversy Ian Gibson (politician)

Mobility.2C training and insignia Impi

11th century parish church of St Leonard Hythe, Kent