Making an archive for WebGlimpse
For an overview of WebGlimpse, please see the
WebGlimpse home page.
What is an archive?
An archive in WebGlimpse is basically a set of files
that determines the searching parameters for a particular set
of pages. These files are located in the 'root' of the
archive (nothing more than a directory). Also, this directory
must be accessible via the WWW (for the .glimpse-eye.jpg)
and must be accessible as 'nobody' (the user that most
http daemons run as).
The files in the archive are listed below.
After WebGlimpse has been isntalled,
you can set up an archive for WebGlimpse.
Files that make up an archive
WebGlimpse is a search engine based on the
Glimpse search engine. Glimpse uses an index file
and several other configuration files
which are created with glimpseindex.
WebGlimpse creates a few files as well:
- .glimpse_* --
files of the format .glimpse_* are generated and used by
glimpse and glimpseindex.
Please see the Glimpse
home page for more information on Glimpse.
- wgreindex --
a file that will call glimpseindex to recreate the
Glimpse index (and associated files), and will also call
ghgenhtml.
- archive.cfg --
a file that is generated by the
ghgenhtml utility. This file contains the
configuration for the archive. The format of the file is:
<archive title> <archive base URL> <subdirectory indices>
where each field is specified by a \t (tab character). The last field
is a 1 (true) or a 0 (false), and denotes whether or not each
subdirectory has a ghindex.html. If not, just the root directory
has a search page.
- .wg* --
miscellaneous WebGlimpse files. This includes files such as a map from
URLs to filenames (.wgmapfile), the list of files to index (.wg_toindex),
the list of files for which neighborhoods have been generated (.wg_madenh),
and the template for the search box (.wgbox.html).
How to make an archive -- the confarc command
The confarc command is
used to make and reconfigure an archive.
It is run by simply typing confarc (no arguments).
You need to answer the prompts, and the archive will be configured
properly.
If you do not index the archive during
the running of confarc, you will need to run the
wgreindex script at a later time (all confarc
does is run this script).
The wgreindex script is the 'main' script for
indexing the archive. If you want to set up a cron job
for reindexing, this script should be run.
confarc will set up several .wg* files in the archive root,
and the call to wgreindex will create several
.glimpse_* files. confarc will also create
the wgreindex script itself and archive.cfg (where
the configuration is stored).
Reconfiguring an archive
To reconfigure the archive, re-run
confarc, and follow the prompts.
confarc will read in your current settings, and prompt you
for changes.
Return to the WebGlimpse home page

Maintained by Michael Smith
glimpse@cs.arizona.edu