Xena normalises HTTP web sites using the HTML plugin. This plugin is part of the basic Xena application.
Before you can normalise a HTTP web site, you will need to set the base directories. See Setting the Base Directories under Configuring Xena.
http://, of the web site that you wish to normalise.
The HTTP web site has now been normalised.
The HTTP Configure window allows you to select various options when normalising a HTTP web site.
The Configuration menu has two options, Load and Save. The Load menu item allows you to load a previously saved configuration, while the Save menu item allows you to save the current configuration for use later.
The URL box is where you enter the URL of the HTTP web site that you wish to normalise. Note that you must include the HTTP:// element of the URL. You can tick the Resume Previous Run? box if you want Xena to continue normalising a website that it had previously tried to normalise and failed.
The Force Normaliser check box allows you to force Xena to use a particular normaliser when it normalises the HTTP web site. You then select the normaliser you wish to use from the drop down list. This is usually used when you wish to Binary normalise a website.
Under Follow links are various options relating to whether you want Xena to normalise elements of the web site other than the resource indicated in the URL box. The Get Images and Resources box must be ticked if you want Xena to normalise the images on a web site along with the text. Checking the Follow Links? box allows you to follow the links on the URL, otherwise you will only capture the item that matches the URL, ie the front page of most web sites. Typing protocols, eg FTP, into the Protocols box allows Xena to follow particular protocols when normalising the web site. Ticking the Can Leave Host box allows Xena to leave the host domain as detailed in the URL box, otherwise it will remain inside the domain. You can limit the domains that Xena will follow by entering the desired domains in the Hosts box, separated by semi-colons.