Save web sites inside of Alfresco and get them indexed. For knowledge workers it's oftenly important to store related web sites within a projects document collection. Also these websites should be indexed as other documents.
After some researches the Microsoft MHT format is the appropriate one. It transforms a web page into a RFC822 message format including all images and other binary elements. Export of MHT files is built in the Microsoft Internet Explorer 7. There is also a plugin available for Firefox.
One has opened a Web site in the Browser and wants to save this page in Alfresco.
The original URL should be a Link so that it's clickable. The Extractor should extract the text content without HTML/XHTML tags for indexing and preview.
Andreas Hartmann, 02.04.2009 10:30: