#title(情報知識学会 2001 年度 第 9 回研究報告会 (2001年5月19日(土))) *情報知識学会 2001 年度 第 9 回研究報告会 (2001年5月19日(土)) [#te94d0b1] **頻度情報を用いた漢字辞書の評価法 &br;−知識ベースの漢字入力に向けて− [#r3169c9a] ''堀 幸雄、池村 匡哉''~ ''神奈川大学''~ This paper reports an evaluation method for Japanese Kanji input system based on frequency information and user's input history information. Japanese Kanji Frequency information follows Zipf's law, user's input history information is follow LRU algorithm's hit rate, reports the result conducted evaluation experiment of this. And considere about a future Japanese Kanji input system. **漢字の異形字表記に対応した検索システム [#j2ec37ad] ''阪口 哲男、赤穂 義範''~ ''図書館情報大学''~ The Japanese character set consists of Hiragana, Katakana, and Kanji. Some Kanji characters have the variants that have the same meaning and pronunsiations. The users of full-text retrieval systems sometimes confuse some Kanji characters with the variants and get insufficient result for their needs. The authors considered that a function that unifies Kanji variants is need for full-text retrieval systems. This paper describes a full-text retrieval system that has functions of unifying Kanji variants. The system has a thesaurus of Japanese Kanji variants and uses it for indexing and unifying Kanji characters. The authors built two example retrieval systems based on the system. They have the datebase of ULIS-DL (http://lib.ulis.ac.jp/) metadate and Japan-MARC. **日米対応特許データに基づく対訳自動抽出 [#uc4ade1e] ''樋口 重人†、福井 雅敏†、藤井 敦††、石川 徹也††''~ ''†パトリス、††図書館情報大学''~ To facilitate retrieving patent infomation across languages, we are developing a multi-lingual patent retrieval system, where user queries are translated into the target language by way of a dictionary. In this paer, aiming to enhance the translation dictionary, we propose a method to automatically extract translations from Japan-US patent families consisting of Japanese/English comparable texts. Ours method computes the association score for each combination of Japanese / English words in patent families, and selects those with greater scores as translations. We also show the effectiveness of our method through experiments. **XML のプレゼンテーションと検索 [#cc70c4b3] ''重元 康晶†、藤澤 由美、宮崎 智、菅原 秀明††''~ ''†富士通、††遺伝研''~ It has been three years since the specification of XML was announced. In the meantime, recommendations have been also made for technologies for applying XML to applications. Here we summarize the merit of XML technologies and introduce the application to an actual date system, i.e., datebases of WFCC-MIRCEN World Date Centre for Microorganisums (WDCM). **情報共有による Z39.50 データベース選択支援環境 [#e6edad82] ''江草 由佳,高久 雅生,宇陀 則彦,石塚 英弘''~ ''図書館情報大学''~ This paper describes the environment that supports selection of Z39.50 databases for users. The environment allows users to share information of Z39.50 databases in WWW. The present authors develop the WWW-Z39.50 client. In this environment, the users can share information which other users organize by themselves. When a user clicks one of list of Z39.50 databases, which is shared information, the window for retrieval of the database will be opened for the user to start his/her retrieval immediately. **利用者からみた Z39.50 を考える [#ecf734d3] ''鳥越 直寿''~ ''インフォコム''~ Z39.50 is the international standard of an information-retrieval protocol, and can offer the history-reference search function, the environment independent from a specific vender, the crossing reference function with unified interface, etc. In the result of the questionnaire and interview to the university library which is the user of Z39.50 and is an information addresser, it is observed as choice of the technical element in the information disclosure, in spite of the very low domestic popularization level. Although many problems remain in the present condition, it is important not to judge Z39.50 from the viewpoint of only the advantage in the present condition, but to judge on the basis of the future possibility. **デジタル・アーカイブの現状と [#z0d2ac2b] ''原田 隆史''~ ''慶應大学''~ The definition of the "digital archives" is ambiguous; various types of organizations, such as web archives, digital museum, data archives and so on, which collect original material in digital form has been called digital archives. This study gives some concrete examples of digital archives and discusses their good effect and problems. Although the good effect given by digitalizing materials is acknowledged widely, the evaluation of digital archives is not easy; for instance, the cost effectiveness is difficult to estimate. One of the roles of the digital archives is to preserve cultural assets; most of them have been founded and operated by government or public organizations. The future of the digital archives might depend on whether the value of them would be given social consensus of their cost effectiveness. **同業者を集めた電子モールシステムの構築 [#mbfd6df7] ''平野 貴弘、野上 暁功、森川 弘信、田中 猛彦、中山 優''~ ''和歌山大学''~ In recent years, electronic commercial transactions have been getting popular. We attend to develop an electronic shopping mall available to the Internet. This shopping mall is constructed by a server computer for receiving the orders together with existing \\databases of the stores which are connected with the server computer. For giving fresh information of the goods to the purchasers, it is indispensable that the server computer and each trader's database frequently exchange messages. We also supply the methods which support the communication. Gathering traders of same goods will lead to a commercial competition. As a result, purchasers will buy fine goods at a lower cost, while the sales dealer will make it easier to secure the new customers and repeaters. **商業出版におけるコンテンツ配信の課題 [#q9b23958] ''深見 拓史''~ ''済堂''~ In this paper I described the Problems of the Contents Distribution System in the Commercial Publishing Business in Japan. Last Time I reported the Influence of the Internet on the Commercial Publishing Business. This time I discussed the changes of the environment of the contents distribution system in the commercial publishing. What was changed in the viewing or browsing software and in the changing mechanism of Internet contents? And also I described some new trials in the world. **情報知識学思案 3 [#m01046f9] ''村上 茂三''~ ''止観第一研''~ We have been making some attempts for a new paradigm of Information and Knowledge Sience. Researching our specific subject \\ "Database for getting new ideas or thoughts", we are always feeling some intellestual gap between the Western thinking manners and ours. Naturally, present Information and Knowledge Science is established on Western Civilizations. On the other hand, our thinking manner is based on Japan Culture. Recently they say that our Japan Culture is unique among many World Civilizations. So we hope new frameworks of which foundation is constructed the complexes of Japan Culture and others. Toward this difficult but pleasant subject, we began to make a small first steep. At present we are interesting in the following ideas as the viewpoints. + Our hope is that the New Paradigms must be based on not only Western Civilizations but also Eastern and Japan Culture. + We have many types of unique logic, arts and skills in our culture. Through the Ancient and Middle times, some of them were born in Japan islands and others came from Korea, China and India. + Now we are trying to discover new merits among them from the viewpoint of information system. **特定構文を用いた用語間の意味関係の抽出 [#qb8d1ab1] ''石川 大介、藤原 譲''~ ''神奈川大学''~ SS-SANS method is to extract semantic relationships with automatic. This work is to apply SS-SANS extract in a associative relationships in the test collection which includes thesis data(300 thousands of japanese abstracts). The process uses templates between two terms for extraction and it extracted automaticaly 150 thousands of relationships. These relationships find out distinction between front term and end term. Front terms are called OBJ terms and end terms are called SBJ terms. Analyse of terms for OBJ terms or SBJ terms and their distributions. In conclusion, distinctions to use terms for OBJ terms or SBJ terms are explained. **意味関係抽出による概念の構造化 [#h9ff1838] ''近藤 雄裕、藤原 譲''~ ''神奈川大学''~ Semantic processing which uses a great deal of knowledge which was structuralized require to be equipped with learning and consideration functions in computers. The conceptual structure according to conceptual relation exists as a construction of structuralized information that is necessary to semantic processing. Systematizing terminology as minimum unit which expresses a concept is most important to semantic processing. It made conceptual structure which combined the C-TRAN method extracting equivalent relationships and the SS-KWEIC method which extracts hierarchy relationships and associative relationships, moreover extracted data by the SS-SANS method extracting associative relationships. Hence the new relationships are found among terminologies and judgment material for semantics of terminology increases. Finally it became possible to get more detailed terminologies to systematize. **構造化された知識を基にした情報検索システム [#w6d0409a] ''森本 貴之、近藤 雄裕、杉田 勝彦、石川 大介、池村 匡哉、藤原 譲''~ ''神奈川大学''~ The global flow of information is being developed at unprecedented speed, and an importance of information retrieval become higher. However, users may not make a good use of huge amount of information by using conventional computers whose major functions are numerical calculation, symbol matching in information retrieval and deduction. Especially, an importance of information retrieval becomes higher. However, it is difficult to search relevant information that is satisfied its purpose from a vast information efficiently. To solve these problems, a new intelligent method of information retrieval using organized knowledge resources based on semantic relationships is proposed. **XML 文章における意味情報の自動推定 [#afe1224d] ''中挟 知延子''~ ''東洋大学''~ This work is a report on the use of a language application using semantic representation in an XML format. The aim of the application is to correct the misuse of Japanese postpositional words, called joshi, as they have been used in sentences written by non-Japanese students. First, the sentences written by the students are procesed morphologically so that pairs of joshi nonus and pairs of joshi verbs are extracted. Next, document data that include the same extracted nonus are picked up from an EDR corpus where there nouns are stored with other pairs of joshi that are used as training example in the correction process. The data are automatically transformed into semantic representation in an XML form. "Association rules" are applied in the correction process to assist in learning how to use the appropriate joshi after target nouns. Semantic representation is used to provide suitable explanations of how joshi should be used in a sentence. **研究者ディレクトリデータベースからのキーワード抽出による分野間の関連分析 [#gc5b7359] ''西澤 正己、孫 媛、矢野 正情''~ ''情報学研究所''~ In this study, relationships between the Information Science and other fields were investigated in terms of keywords extracted from research themes of each researcher. First, procedure for extracting keywords from the datebase of Directory of Researchers was presented. As an index measuring the relationship, we introduced a coefficient of normalized complete congruence and further applied the Correspondence Analysis method to analyze the relations between the three items of the so-called Information Science and other related research fields at Japanese universities. **ネットニュースにおける対立の分析手法の提案 [#f65bbb6c] ''瀬尾 雄三†、矢野 正情††''~ ''†東大先端経工セ、††情報学研究所''~ In the Net News, conflicts among the participants are often observed. It seems that the conflicts are often caused by the difference in the cultural background of the participants. In this paper, we intend to propose a quantitative extraction method of the conflicts. Articles in the Net News can be positioned in tree structures (threads) defined by the referential relaation. A thread is treated as a set of samples for the analysis. Each article is distinguished by key word frequencies and they are summarized to principal component scores. Applying this technique to actual Net News threads, we found that when a thread includes conflicts, some scores vibrate with wavelength of two generations in the thread. Also the key of the conflict can be estimated with the meaning of the vibrating principal component. We expect that such a quantitative extraction technique of conflicts is useful for study of the sosial relation through electronic communication. **「知識創発」を支援するドキュメント・マネジメント [#ta895f89] ''西村 健''~ ''ドキュメント・エンジニアリング研究所''~ ホワイトカラーの業務活動において過半を占める「思考的、準思考的作業」は、知識の組換え、並べ替えなど文書内容のダイナミックな加工編集処理が中心であるが、これまでの文書管理では、固定的な紙文書だけを扱っていたため、電子文書といってもワープロ、PDF、グループウェアなどに限られていた。「XMLドキュメント・マネジメント・システム」は、こうした動的な業務プロセスの中心である文書内容のダイナミックな編集処理加工性能を担保することにより、「知識創発」を促し、大幅な知的生産性向上を支援できる。