--- Forwarded Message from Richard Kunst <[log in to unmask]> --- >From: Richard Kunst <[log in to unmask]> >Reply-To: [log in to unmask] >To: Language Learning and Technology International Information Forum <[log in to unmask]> >Cc: "C. Ratcliff" <[log in to unmask]> >Subject: Re: #4873 Asian lang. online dictionaries >In-Reply-To: <[log in to unmask]> >Date: Sat, 27 Feb 1999 13:35:59 -0500 (Eastern Standard Time) >Priority: NORMAL ------------------ On Tue, 23 Feb 1999 17:08:08 EST LLTI-Editor <[log in to unmask]> wrote: > > Is there anyone who knows of online dictionaries for Chinese, Japanese, > Korean, Indonesian, Thai, or Hindic languages which allows one to perform > so-called "wild card" searches? Christian- The on-line dictionary engine which is part of the WinCALIS/UniEdit system does what you are seeking, although it is not quite as flexible as a fully wild card-based system. You can search for words which contain strings of 1 or more Han characters (kanji); 2 or more Japanese kana characters; or 3 or more alphabetic characters (*any* alphabet). The characters can be in any position in a word. The search can be limited to whole words (bounded by spaces or predefined delimiters, such as zero-width spaces or "joiners") or the beginning of words (to ignore inflection, since there is no automatic lemmatization), or the search can be unlimited. As for dictionaries using this system, at the moment I am only aware of dictionaries of any useful size for Japanese-and-English (the Jim Breen dictionary database of 105,000 entries--mostly proper nouns) and Korean-and-English (25,000+ entries), which we at Duke distribute -- there may be more out there. But the WinCALIS/UniEdit dictionary engine can be applied to *any* Unicode-encoded text database which has been formatted in a specific way (newline = record separator, '/' = field separator). Thus if you have a database of lexical information for, say, Indonesian, or Bengali, if you format it as above, then run the automatic indexer, you could then use the dictionary engine to search it the way you desire. I have perhaps jumped to a conclusion about your use of "online." The above dictionary engine is for use on-line in Windows, but not on the Web, at least at the moment (although we could certainly adapt it so that a Java client like Java WebCALIS or Java UniEdit could make use of it--just haven't gotten around to that yet. One good way to get the dictionary engine, along with documentation and the Japanese and Korean dictionaries, is to download the Uniedit or WinCALIS Author Tryout Editions from our Website, http://www.lang.duke.edu. Contact me directly for more information. I would welcome an incentive to make the newest 32-bit Dictionary Module available for download, since it can exchange Unicode text data on the clipboard nicely with other Unicode apps such as MS Office, MSIE, and Netscape. Thus you can highlight text in a Web browser and look it up using the WinCALIS/UniEdit dictionary engine. Best, Rick Kunst _/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/ OIT Humanities Computing Facility Tel. (919) 660-3194 319 North Building - Box 90269 Fax: (919) 660-3191 Duke University E-mail [log in to unmask] Durham, NC 27708 USA http://www.lang.duke.edu _/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/_/