The OCR applied here only serves for reasons of indexing PDF files. The page layout will get lost. Nevertheless, the following three steps help you to convert multipage PDF files to a single text file: $ convert -density 150 foo.pdf ./tesseract/tmp/p%02d.tif $ montage.exe ./tesseract/tmp/*.tif -tile 1x -mode concatenate ./tesseract/tmp/foo.tif $ tesseract.exe ./tesseract/tmp/foo.tif output -l eng
For reasons of simplicity the TIF files p00.tif to pXY.tif will get concatenated together to a single TIF file, that has the width of a single page and the height of XY pages. In such a way at least the order of the text or the text flow respectively will be preserved. But one could also concatenate a mosaic of all the TIF files. The density of 150 (dpi) gives reasonable results with tesseract.
Heute habe ich mich mal der Version von Wikipedia auf CD-ROM - auch etwa "offline" Version genannt - angenommen . Die CD wird bereits nicht mehr neu aufgelegt, da am 17. März 2005 die Wikipedia-DVD erscheint. Dennoch lohnt sich der Blick in die freie Enzyklopädie, die dann direkt ab der Harddisk läuft. Der Reader für Windows (Digigbib4), MaxOS X (MacDigibib) und Linux (Digibux) ist zum Offline-Genuss Voraussetzung.
Unter Windows lässt sich das ISO auch mittels den Daemontools  direkt mounten, so dass sich ein vorübergehendes Brennen auf eine CD-ROM erübrigt.
Parallel laufen auch Projekte, Wikipedia mittels eines Knoppix-ähnlichen Systems direkt ab CD-ROM laufen zu lassen (e.g. via Morphix oder Lamppix) [3,4].
Tell me if I am wrong, but wasn't it one of the key features of HTML at the time of its introduction to have pages linked by <a href=''>link</a>? Linking from inside of page A.html to a page B.html. But if one renames page B.html to C.html and does not rewire it by hand or use rewrite tricks, one gets what currently the NZZ online is playing with. Is it intention or are they just not aware of? Please give us the web back. ;)
Unfortunately  is not a working substitute to  as one would expect:
To motivate all faculties of a University to change their websites [e.g. 1] to a corporate style is quite a bit of work. The University of Bern has chosen a two-step program towards a CMS handled solution. Firstly, all the pages have been redesigned to the new style; most of that part is already realized. Secondly, they intend to move those pages to the CMS, which is said to be Apache Lenya . My first idea was, that this is some kind of Sysiphus-work but while redesigning the wiso page, I recognized that I rather like the two-step concept ;)
Yet another way to share Outlook calendar events is to have a look at Blogwave  in conjunction with a calendar adapter :
BlogWave is an "RSS Generator": a tool which can pull information from a variety of sources and publish it as RSS. This process is very easy to configure and can be scheduled to run automatically. For example, using BlogWave you can create an RSS feed from Sharepoint announcements on your company's internal site. Or you can publish event logs as RSS.
chregu  of Bitflux GmbH is looking for a developer with strong skills in XSLT and PHP. Bitflux has strong competences in realizing custom client projects while mostly using their own Bitflux CMS (BxCMS). An other interesting product is their Bitflux Editor, a Wysiwyg XML Editor. Most if not all of their tools are Open Source.
By default, Google Desktop Search collects a limited amount of non-personal information from your computer and sends it to Google. This includes summary information, such as the number of searches you do and the time it takes for you to see your results, and application reports we'll use to make the program better. You can opt out of sending this information during the installation process or from the application preferences at any time.
Personally identifying information, such as your name or address, will not be sent to Google without your explicit permission.
The last paragraph seems the tricky one. From reading that sentence it looks as if Google has access or is able to access or/and collect address information on your computer. The term "without your explicit permission" looks like the enduser is only a click away from letting Google effectively use that information. We will probably see/read more in the blogosphere sooner or later. ;)
I am dropping this link here as a reminder for myself: Book on Demand . If this is the way of straighter bookmaking for authors, why not. There is another service in Germany as far as I know, called bod.de .
Edit This Page PHP  is a PHP script that can be uploaded to any webhost that supports PHP. It allows for the HTML content of a page to be edited by a link on that page. Only two files are required: the core PHP script (editthispage.php) and a data file for each page. The core file can support as many pages as desired. Get more background information on that lightweight Wiki from Christopher Allen , one of its authors.
Swisscom has a cool product called Easypay . You buy a prepayed card with denominations of CHF 25.00, 50.00 and 75.00 at local stores, Kiosks, Post Offices etc. If you like to buy a product via an online store or like to use an online service you simply enter the Easypay number in the window on the provider’s website and press "buy". The Easypay service is also available in combination with mobile phones.
This is very very practical for providers of microcontent such as blogging services with MMS or mp3 music shops. I have no idea how simple it is to implement that service on the provider's side; but it is worth a try.
The Zulu  web generator implementation consists of an Excel table and some Visual Basic code. The layout and structure of the website is adjusted by editing the Excel table. Zulu's main idea is the separation of layout, structure and content of a website. By processing them the Zulu web generator creates static websites which can be sent to the web server.