You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
35 lines
1.6 KiB
Plaintext
35 lines
1.6 KiB
Plaintext
3 years ago
|
ht://Dig contributed scripts
|
||
|
|
||
|
This directory tree contains perl and shell programs that attempt to
|
||
|
do things with the generated databases. Most of these were written
|
||
|
for a very specific purpose for the specific version of ht://Dig that
|
||
|
was current at that point. This means that some of these programs
|
||
|
will be severely broken! Do not expect them to work; use them only as
|
||
|
examples of the types of things you can do with the ht://Dig
|
||
|
databases.
|
||
|
|
||
|
More contributed work is available on the ht://Dig website:
|
||
|
<http://www.htdig.org/contrib/>
|
||
|
|
||
|
What's here:
|
||
|
|
||
|
acroconv.pl An external converter script that uses acroread to parse PDFs
|
||
|
autorun An example of automating the database building
|
||
|
changehost A script to change hostnames of URLs in the databases
|
||
|
conv_doc.pl A sample script to use the conversion features of external_parsers
|
||
|
doclist List the information in the doc db (or after a certain date)
|
||
|
ewswrap Two sample htsearch wrappers to emulate Excite for Web
|
||
|
Servers (EWS) and to simplify queries
|
||
|
handler.pl A sample external_protocols script to handle HTTP/HTTPS using curl
|
||
|
htparsedoc A sample shell script to parse Word documents
|
||
|
multidig A set of scripts to simplify updating multiple databases
|
||
|
parse_doc.pl A general external parser script that handles MS Word documents
|
||
|
(among others)
|
||
|
run-robot.sh Another example of automating the database building
|
||
|
scriptname An example of using htsearch within dynamic SSI pages
|
||
|
status.pl Build a status page of last 5 runs and top 10
|
||
|
servers (by # URLs)
|
||
|
urlindex Build an index of all the URLs in the database
|
||
|
whatsnew Build a "what's new" page with custom header and footer
|
||
|
wordfreq Build a list of words and frequency in the database
|