c# - Convert language characters to Latin alphabet -
i try program application learn foreign characters. if example want learn japanese, you'd have memorize hiragana, katakana , kanji letters. (e.g あ、い、か... = hiragana カ、サ、ケ... = katakana 本、学... = kanji).
example: user trying learn japanese. has learn: か = ka
本 = hon, meaning: basis/ book/ this
and has learn pronunciation.
my first question if there library or in .net? looked @ microsoft ime, couldn't find out how use in project.
i looked @ unicode database , it's possible it. managed write project convert か ka. thing missing meanings (for example 本=basis/ book/ this), provided unicode database. unfortunately couldn't find them in .xml file ucd data. works, when enter word on ucd-website.
another approach use cldr-library seems related ucd. unfortunately couldn't figure out of 2 (ucd or cldr) should use. cldr: http://cldr.unicode.org/
my question if ucd best way , if use cldr.
i don't want work normal lists type in characters myself. take time, kanji letters (more 10,000).
thanks
edit: solved it, extract information unicode character database (ucd). can download whole database in .xml file. needed learn how handle , find correct attributes.
both google , microsoft offer apis can call translate text. eg http://www.microsoft.com/en-us/translator/translatorapi.aspx
depending type of service choose small fee might required. offer sounds translation. no need re-invent wheel. :-)
if codepage type question. blog amusing place start http://www.joelonsoftware.com/articles/unicode.html
edit: in response comment options. google can supply several possible translations
eg 本
Comments
Post a Comment