NAME

extractHosts


SYNOPSIS

Read the hosts.xml file into a perl hash. Select hosts from this database, by specified capabilities, speeds, firewall, and accessibility from a specified ISP/firewall/config. Write out a file in human-readable/statProxy input format for later checking.

perl extractHosts.pl resultFile [firewall=ffff] [from=ffff] [<tag>[=regex] ...]


OPTIONS

resultFile
        Send the selected result set to this file. 
A '-' pipes to STDOUT (probably your screen)
firewall
        Only proxies inside this firewall are selected
from
        Only proxies which are accessible from inside 
this firewall are selected
tag
        Only proxies with capability tag matching the 
accompanying regular expression are selected.
If the regex is omitted, the corresponding tag
value is defaulted to 1. There may be more than
one tag expression.

All option terms are 'AND'ed together


EXAMPLES

Find proxies on port 8000, which are enabled:
perl extractHosts.pl extract.out isEnabled addressPort=.*:8000
Find proxies on port 3128, enabled, which are in the KSA Ogertel network:
perl extractHosts.pl port3128KSA.out addressPort=.*:3128 isEnabled firewall=KSA-zajil
Find proxies accessible from ISP UAE-dialup:
perl extractHosts.pl uae.out isEnabled from=UAE-dialup
Find non-censoring proxies accessible from any KSA ISP:
perl extractHosts.pl ksa-isuProxies.out isEnabled from=KSA-ISU nonCensoring
Find all proxies except those on specified ports; list on screen:
perl extractHosts.pl - -addressPort="^.*:(80|3128|8080|8000|8888|1080|8081|6588)$"
Find high speed proxies (reference page time less than 2 secs); list on screen:
perl extractHosts.pl - referencePageTime="^[0-1]\.\d+$"

In a Unixish environment, you will need to use single quote delimiters on regexes and you may need to escape different shell metacharacters