TEL-8 Query Interfaces
This dataset contains the original query interfaces and their manually extracted query capabilities of 447 deep Web sources from 8 representative domains, which form 3 groups "TEL" (and thus the name of the dataset)-- in the Travel group: Airfares, Hotels, and Car Rentals;
in the Entertainment group: Books, Movies, and Music Records; in
the Living group: Jobs and Automobiles.
For each source, this dataset archives its root homepages and query-interface
pages. In addition, it includes the manually extracted query capability for each interface.
The document describes the creation and
usage of this dataset.
Note that, the zipped file can be properly uncompressed in Linux. However, it may cause errors when uncompressing in Windows due to the problem of unsupported file name format.
Tasks Using This Dataset
Kevin Chen-Chuan Chang, Bin He, Chengkai Li, and Zhen Zhang
Computer Science Department
University at Illinois at Urbana-Champaign
Date Created: May 2003
Back to UIUC Web Integration Repository