The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. ex. Some numerals are expressed as "XNUMX".
Copyrights notice
The original paper is in English. Non-English content has been machine-translated and may contain typographical errors or mistranslations. Copyrights notice
구조화된 문서에는 의미 체계가 자연스럽게 데이터베이스 값으로 저장되거나 데이터베이스 값과 직접적으로 대응되는 문자열이 포함되는 경우가 많습니다. 문서의 문자열과 해당 데이터베이스 값 사이에 양방향 논리적 링크를 구축함으로써 의미가 풍부한 쿼리를 표현할 수 있습니다. 데이터베이스 값과의 링크가 있는 텍스트를 모델링하기 위해 "paratext"라는 새로운 ADT를 도입했습니다. 파라텍스트는 논리적으로 두 개의 평행 레이어로 구성된 것으로 간주됩니다. "외관" 레이어에는 일반 텍스트(예: 문자열의 선형 시퀀스)가 배치되는 반면 "참조" 레이어에는 OID 및 리터럴 배열이 포함됩니다. 참조 레이어의 각 OID 또는 리터럴은 모양 레이어 텍스트의 연속 하위 문자열과 연관되어 있으며 연관된 하위 문자열의 의미를 나타냅니다. 우리는 또한 이 문서 모델에 대한 도메인별 기능을 설계했습니다. 함수를 사용하면 두 레이어 사이를 오가는 쿼리를 표현할 수 있습니다. 구조화된 문서에서 이러한 문자열은 논리적 요소의 전체 내용에 나타날 수도 있고 논리적 요소 내의 문구로 나타날 수도 있습니다. 또한 파라텍스트 ADT 구현을 위한 프레임워크를 제시하고 전통적인 전체 텍스트 인덱싱 기술을 확장하여 파라텍스트를 지원하는 방법에 대해 논의합니다.
The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. See IEICE Provisions on Copyright for details.
부
Masatoshi YOSHIKAWA, Hiroyuki KATO, Hiroko KINUTANI, "Design Framework of a Database for Structured Documents with Object Links" in IEICE TRANSACTIONS on Information,
vol. E82-D, no. 1, pp. 147-155, January 1999, doi: .
Abstract: Structured documents often contain character strings of which semantics can be naturally stored as database values or has direct correspondence with database values. By building bilateral logical links between character strings in documents and corresponding database values, semantically rich queries are made expressible. We have introduced a new ADT, named "paratext," to model text which has links with database values. Paratexts are logically viewed as consisting of two parallel layers; on the "appearance" layer, ordinary text (i. e. a linear sequence of character strings) is placed, while the "reference" layer holds an array of OIDs and literals. Each OID or literal on the reference layer is associated with a contiguous substring of the appearance layer text, and represents the semantics of the associated substring. We have also designed domain-specific functions for this document model. Using the functions, we can express queries which go back and forth between the two layers. In structured documents, such character strings can appear in the whole content of logical elements, or as phrases inside logical elements. We also present frameworks for the implementation of the paratext ADT, and discuss how traditional full-text indexing techniques can be extended to support paratext.
URL: https://global.ieice.org/en_transactions/information/10.1587/e82-d_1_147/_p
부
@ARTICLE{e82-d_1_147,
author={Masatoshi YOSHIKAWA, Hiroyuki KATO, Hiroko KINUTANI, },
journal={IEICE TRANSACTIONS on Information},
title={Design Framework of a Database for Structured Documents with Object Links},
year={1999},
volume={E82-D},
number={1},
pages={147-155},
abstract={Structured documents often contain character strings of which semantics can be naturally stored as database values or has direct correspondence with database values. By building bilateral logical links between character strings in documents and corresponding database values, semantically rich queries are made expressible. We have introduced a new ADT, named "paratext," to model text which has links with database values. Paratexts are logically viewed as consisting of two parallel layers; on the "appearance" layer, ordinary text (i. e. a linear sequence of character strings) is placed, while the "reference" layer holds an array of OIDs and literals. Each OID or literal on the reference layer is associated with a contiguous substring of the appearance layer text, and represents the semantics of the associated substring. We have also designed domain-specific functions for this document model. Using the functions, we can express queries which go back and forth between the two layers. In structured documents, such character strings can appear in the whole content of logical elements, or as phrases inside logical elements. We also present frameworks for the implementation of the paratext ADT, and discuss how traditional full-text indexing techniques can be extended to support paratext.},
keywords={},
doi={},
ISSN={},
month={January},}
부
TY - JOUR
TI - Design Framework of a Database for Structured Documents with Object Links
T2 - IEICE TRANSACTIONS on Information
SP - 147
EP - 155
AU - Masatoshi YOSHIKAWA
AU - Hiroyuki KATO
AU - Hiroko KINUTANI
PY - 1999
DO -
JO - IEICE TRANSACTIONS on Information
SN -
VL - E82-D
IS - 1
JA - IEICE TRANSACTIONS on Information
Y1 - January 1999
AB - Structured documents often contain character strings of which semantics can be naturally stored as database values or has direct correspondence with database values. By building bilateral logical links between character strings in documents and corresponding database values, semantically rich queries are made expressible. We have introduced a new ADT, named "paratext," to model text which has links with database values. Paratexts are logically viewed as consisting of two parallel layers; on the "appearance" layer, ordinary text (i. e. a linear sequence of character strings) is placed, while the "reference" layer holds an array of OIDs and literals. Each OID or literal on the reference layer is associated with a contiguous substring of the appearance layer text, and represents the semantics of the associated substring. We have also designed domain-specific functions for this document model. Using the functions, we can express queries which go back and forth between the two layers. In structured documents, such character strings can appear in the whole content of logical elements, or as phrases inside logical elements. We also present frameworks for the implementation of the paratext ADT, and discuss how traditional full-text indexing techniques can be extended to support paratext.
ER -