波音游戏源码-波音博彩公司评级_百家乐园天将_新全讯网3344111.c(中国)·官方网站

Words on the web: a new direction for semantics

 

A Website Management Programme for students in the Bachelor of Arts with Honours Degree in Language Information Science, offered by CityU's Department of Chinese,Translation and Linguistics, is proving a great success, with student participation now including a large number of students from all three years. The programme is run by Dr Jonathan Webster, Acting Head of CityU's Department of English and Communication. It is one of several projects in which Dr Webster is involved in affiliation with the Institute of Chinese Linguistics (ICL). "The programme has grown considerably from when there were maybe five or six students interested to the present, when almost the entire class is involved. It reflects a great change in student awareness of the importance of the web and IT, in terms of their future careers." 

The programme, which has been running for several years, provides an opportunity for students to gain hands-on experience in managing a web server. The initial focus was putting together a website that was directly related to topics in Chinese language and linguistics. Over the years this has expanded, as students have become involved in other projects, such as putting together a "grammar surgery" for the English Language Centre. It's a computer-assisted language-learning tool for students, with a fun slant of going to the "surgery" to visit the "grammar doctor".

Machine translation project

One of the more challenging projects currently being undertaken in affiliation with the ICL is the application of semantic web technology for an example-based machine translation project (EBMT). Other colleagues from the Department of Chinese, Translation and Linguistics participating in the project along with Dr Webster are Dr K K Sin , Dr H Pan and Mr Caesar Lun .

The EBMT project applies the "Example-based" approach to the translation of the specialized language of legislation and legal documents. The corpus being used with the project consists of bilingual law dictionaries and glossaries, the bilingual texts of Hong Kong legislation, conveyance documents and other legal materials such as judgments, court documents and contracts. The purpose of the project is to meet the growing demand for bilingual legal texts as Hong Kong's legal system converts from a monolingual to a bilingual legal system, while at the same time exploring the full potential of the example-based approach.

The initial task is to design a best-match algorithm for translated text spans ranging in size and scope from words to phrases, clauses and sentence patterns. The algorithm will be rigorously tested and human input of improved translations will be constantly incorporated into the corpus in order to build up and develop the learning ability of the algorithm. In turn, this will enhance the accuracy, consistency and intelligibility of the translated text.

Going through the phases

The EBMT project, which is funded by the University Grants Committee, has three phases: example acquisition, example application and example-base management.

The first phase, example acquisition, is nearly completed. This has involved the text alignment of the 25 million word Bilingual Laws Information System (BLIS) Corpora. The text alignment occurs at various linguistic levels, including word, phrase, clause and sentence. "The BLIS corpus was selected because it was an amazingly good rich text to work with and it was translated by experts. Progress has been quite good in this very difficult phase at the beginning, which is to get the examples by doing the alignment between English and Chinese," said Dr Webster.

Phase two, the example application phase, is currently in progress and deals with how existing examples are used to facilitate translation. The main issues include identification of useful examples in an input sentence, determination of a sequence of identified examples to be used in composing the translation, and further manipulation of the target language parts to render the composition. This is actually the translation process.

Within a year, the team hopes to have a prototype for example application where the examples are in a database and used to improve translation.

The third phase concerns the management of the example-base where the examples are stored in such a way as to facilitate subsequent retrieval. This method draws on advances in semantic web technology.

The semantic web approach provides the means for rendering information in a machine-processable form. "Basically, the web is now moving in a direction of how you can represent the meaning of the text, instead of just having a repository of documents. Instead it will have a rich knowledge base from which to draw information," Dr Webster explained.

Bilingual dictionary database

Another project Dr Webster is working on uses semantic web technology with a bilingual dictionary database. This will be useful for natural language processing as well as being a practical tool for any users of the web.

"Many databases today are very fixed - you input something and retrieve information following a fixed format. With this technology you will be able to store information and retrieve it using rules and inference. The only way you can do that is if your database is rich in terms of knowledge."

YOU MAY BE INTERESTED

Contact Information

Communications and Institutional Research Office

Back to top
百家乐官网注册下注平台| 大发888秘籍| 24山72向水口吉凶断| 皇冠体育| 百家乐翻天粤语下载| 百家乐官网2号干扰| 百家乐五湖四海娱乐| 百家乐官网单机版游戏下载| 水果机榨汁机| 678百家乐官网博彩娱乐平台| 大发888明星婚讯| 利澳百家乐官网的玩法技巧和规则| 谈大发888风水和运气| 百家乐官网过滤工具| 最新皇冠网址| 玩百家乐游戏的最高技巧| 百家乐官网轮盘技巧| 皇冠网百家乐啊| 赢家百家乐官网的玩法技巧和规则| 波克棋牌下载| 百家乐官网1元投注| 正网皇冠开户| 国际娱百家乐的玩法技巧和规则| 百家乐官网棋牌游戏源码| 六合彩投注网| 网络百家乐的玩法技巧和规则| 大佬百家乐官网的玩法技巧和规则 | 银河娱乐场| 赌片百家乐的玩法技巧和规则| 华盛顿百家乐官网的玩法技巧和规则 | 百家乐官网真人游戏| 百家乐官网龙虎斗扎金花| 百乐坊娱乐城官网| 中骏百家乐的玩法技巧和规则| 24山向吉凶详解| 百家乐官网出千的方法| 皇冠网| 88娱乐城网址tlyd| 全讯网社区| 百家乐刷钱| 国际娱百家乐的玩法技巧和规则 |