Chapter 15: Information Search and Visualization
15.2 Database Query and Phrase Search in Textual Documents
-
Searching in structured relational database systems well established task
using SQL language
-
Users write queries that specify matches on attribute levels
-
Example of SQL command
-
SELECT DOCUMENT#
-
FROM JOURNAL-DB
-
WHERE (Date >= and Date<= 1998)
-
and (Language = english or french)
-
and (publisher = Asis or Hfes or ACM).
-
SQL has powerful features, but it requires 2 to 20 hours training
-
While SQL is a standard, many fill-in variants
-
Finding a way not to overwhelm novice users is a challenge
-
Evidence shows that users perform better and have higher satisfaction when
they can view and control the search
Improved designs and consistency across multiple platforms can:
-
bring faster performance
-
reduce mistaken assumptions
-
increase success in finding items
-
Example: AltaVista, Lycos, Infoseek
-
'direct manipulation' could produce:
-
search on the exact string 'direct manipulation'
-
probabilistic search for 'direct' and 'manipulation'
-
probabilistic search for 'direct' and 'manipulation' with some weighting
if the terms are in close proximity
-
boolean search on 'direct' and 'manipulation'
-
boolean search on 'direct' or 'manipulation'
-
error message indicating missing and/or operator or other delimiters
Framework to coordinate design practice:
-
Formulation
-
source of the information
-
fields for limiting the source
-
phrases
-
variants
-
Action
-
explicit or implicit
-
most systems have a search button for explicit initiation, or for delayed
or regularly scheduled initiation
-
Results
-
read messages
-
view textual lists
-
manipulate visualizations
-
Refinement
-
should provide meaningful messages to explain search outcomes
-
should support progressive refinement
-
The four-phase framework can be applied by designers to make the search
process more visible, comprehensible and controllable by users.


