ComicGrabber is a small and simple utility to automate the process of fetching images that are embedded in web pages. About ComicGrabber ------------------- ComicGrabber was written to fetch the famous Dilbert comics from their website to archive them. While the program may be useful to fetch other images too, no provisions for this were made and there are also no plans at this point to add support for other websites explicitly. Maybe some will work with the current approach - I haven't tried. If you develop a nifty addition to support other websites, let me know and I will probably add it. What it does ------------ There were two reasons for me to write ComicGrabber: 1) United Media must have decided to mangle the file names of the GIF or JPEG files with the comic strips. (Which is rather useless and silly if you ask me.) They are adding a random sequence of digits in the date part of the file name. So you have to edit the file name of every comic strip you want to save. The funny thing is that they are always adding it at the same point (after the "2" of 2005 on Monday, after the "20" of 2005 on Tuesday etc.) which makes it even more useless. 2) Laziness. :-) It's quite boring to click through the web pages and save the comics, no matter if you have to tweak the file name or not. How to use it ------------- The following text fields are prefilled with the data for the Dilbert comics. If you change them, they are stored via the Java Preferences API when the program exits. (In Windows, they are thus stored in the registry.) "Main URL" This is the URL of the main page where the search starts. So you can see that ComicGrabber will only work for websites with an index page that has links to all pages containing an image that should be downloaded. This should be the case for most websites, though. "Start of link URL" ComicGrabber must somehow be able to recognize the hyperlinks to the subpages that contain the images. This is done with this prefix, i.e. all links starting with this string will be followed. "Start of image URL" When it has loaded a page with an image on it, ComicGrabber must somehow know which images to save. They are recognized by this prefix. "Destination folder" Folder on your local storage (harddisk, network, whatever) where the images should be saved. To save the images, ComicGrabber uses the name of the page on which it was found, assuming that it has a usable name for it. While this is true for the Dilbert archive (and seems to be for all United Media comics), it is clear that a more complicated approach would most likely be needed for other websites. "Fetch" To start fetching the images, press this button. In the log window in the lower part of the program window, you will see what ComicGrabber is doing and if there are any problems. Please note that it is not unusual to see the message "File already exists or could not be created" as you will often already have some images that are still on the website from a previous fetch. As you can see below, it is one of my tasks to make a distinction between the two cases as the latter one indicates a rather serious error (such as missing permissions to create the file in the chosen destination folder or no space on the medium). "About" This opens a dialog window with version information etc. about the program. In this window, you also find two buttons to display this ReadMe and the copyright information. Well, you've probably already found the first button unless you are reading this on my website. ;-) To Do ----- 1) Add a file chooser to be able to choose the directory where the images will be saved. 2) In the log, make a distinction between "file already exists" and "file could not be created". Contact ------- If you want to contact me, feel free to mail me at comicgrabber@nur-weiter-so.de! Copyright (C) 2005 Markus Bäurle --------------------------------- This program is free software; you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation; either version 2 of the License, or (at your option) any later version. This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details. You should have received a copy of the GNU General Public License along with this program; if not, write to the Free Software Foundation, Inc., 59 Temple Place, Suite 330, Boston, MA 02111-1307 USA