TEL-8 Query Interfaces


Abstract

This dataset contains the original query interfaces and their manually extracted query capabilities of 447 deep Web sources from 8 representative domains, which form 3 groups "TEL" (and thus the name of the dataset)-- in the Travel group: Airfares, Hotels, and Car Rentals; in the Entertainment group: Books, Movies, and Music Records; in the Living group: Jobs and Automobiles. For each source, this dataset archives its root homepages and query-interface pages. In addition, it includes the manually extracted query capability for each interface.   

Documentation

The document describes the creation and usage of this dataset.


Data files

Note that, the zipped file can be properly uncompressed in Linux. However, it may cause errors when uncompressing in Windows due to the problem of unsupported file name format.

Tasks Using This Dataset


Sources

Original Owners

Kevin Chen-Chuan Chang, Bin He, Chengkai Li, and Zhen Zhang
Computer Science Department
University at Illinois at Urbana-Champaign
binhe[at]uiuc.edu

Date Created: May 2003


Back to UIUC Web Integration Repository