A Conceptual-Modeling Approach to Extracting Data from the Web.

David W. Embley, Douglas M. Campbell, Y. S. Jiang, Stephen W. Liddle, Yiu-Kai Ng, Dallan Quass, Randy D. Smith: A Conceptual-Modeling Approach to Extracting Data from the Web. ER 1998: 78-91
  author    = {David W. Embley and
               Douglas M. Campbell and
               Y. S. Jiang and
               Stephen W. Liddle and
               Yiu-Kai Ng and
               Dallan Quass and
               Randy D. Smith},
  editor    = {Tok Wang Ling and
               Sudha Ram and
               Mong-Li Lee},
  title     = {A Conceptual-Modeling Approach to Extracting Data from the Web},
  booktitle = {Conceptual Modeling - ER '98, 17th International Conference on
               Conceptual Modeling, Singapore, November 16-19, 1998, Proceedings},
  publisher = {Springer},
  series    = {Lecture Notes in Computer Science},
  volume    = {1507},
  year      = {1998},
  isbn      = {3-540-65189-6},
  pages     = {78-91},
  ee        = {db/conf/er/EmbleyCJLNQS98.html},
  crossref  = {DBLP:conf/er/98},
  bibsource = {DBLP,}

ACM SIGMOD Anthology

CDROM Version: Load the CDROM "Volume 2 Issue 1, ER 1979-1998" and ... DVD Version: Load ACM SIGMOD Anthology DVD 1" and ... BibTeX


Brad Adelberg: NoDoSE - A Tool for Semi-Automatically Extracting Semi-Structured Data from Text Documents. SIGMOD Conference 1998: 283-294 BibTeX
Peter M. G. Apers: Identifying Internet-related Database Research. East/West Database Workshop 1994: 183-193 BibTeX
Gustavo O. Arocena, Alberto O. Mendelzon: WebOQL: Restructuring Documents, Databases, and Webs. ICDE 1998: 24-33 BibTeX
Naveen Ashish, Craig A. Knoblock: Wrapper Generation for Semi-structured Internet Sources. SIGMOD Record 26(4): 8-15(1997) BibTeX
Paolo Atzeni, Giansalvatore Mecca: Cut & Paste. PODS 1997: 144-153 BibTeX
James R. Cowie, Wendy G. Lehnert: Information Extraction. Commun. ACM 39(1): 80-91(1996) BibTeX
Robert B. Doorenbos, Oren Etzioni, Daniel S. Weld: A Scalable Comparison-Shopping Agent for the World-Wide Web. Agents 1997: 39-48 BibTeX
David W. Embley, Douglas M. Campbell, Randy D. Smith, Stephen W. Liddle: Ontology-Based Extraction and Structuring of Information from Data-Rich Unstructured Documents. CIKM 1998: 52-59 BibTeX
Ashish Gupta, Venky Harinarayan, Anand Rajaraman: Virtual Database technology. SIGMOD Record 26(4): 57-61(1997) BibTeX
Nicholas Kushmerick, Daniel S. Weld, Robert B. Doorenbos: Wrapper Induction for Information Extraction. IJCAI (1) 1997: 729-737 BibTeX
Stephen W. Liddle, David W. Embley, Scott N. Woodfield: Unifying Modelling and Programming through an Active, Object-Oriented, Model-Equivalent Programming Language. OOER 1995: 55-64 BibTeX
Stephen Soderland: Learning to Extract Text-Based Information from the World Wide Web. KDD 1997: 251-254 BibTeX

Referenced by

  1. M. Tamer Özsu: Review - Record-Boundary Discovery in Web Documents. ACM SIGMOD Digital Review 2: (2000)
  2. David W. Embley, Y. S. Jiang, Yiu-Kai Ng: Record-Boundary Discovery in Web Documents. SIGMOD Conference 1999: 467-478
  3. Wolfgang May, Rainer Himmeröder, Georg Lausen, Bertram Ludäscher: A Unified Framework for Wrapping, Mediating and Restructuring Information from the Web. ER (Workshops) 1999: 307-320
  4. Terje Brasethvik, Jon Atle Gulla: Semantically Accessing Documents Using Conceptual Model Descriptions. ER (Workshops) 1999: 321-333
ACM SIGMOD Anthology - DBLP: [Home | Search: Author, Title | Conferences | Journals]
Lecture Notes in Computer Science: Copyright © by Springer
ACM SIGMOD Anthology: Copyright © by ACM (, Corrections:
DBLP: Copyright © by Michael Ley (, last change: Sat May 16 23:10:16 2009