Sie sind auf Seite 1von 15

HTTrack Website Copier release history:

-------------------------------------This file lists all changes and fixes that have been made for HTTrack.
3.47-13
+ Fixed: random closing of files/sockets, leading to "zip_zipWriteInFileInZip_fa
iled" assertion, "bogus state" messages, or random garbage in downloaded files
+ Fixed: libssl.dylib is now in the search list for libssl on OSX (Nils Breunese
)
+ Fixed: bogus charset because the meta http-equiv tag is placed too far in the
html page
+ Fixed: incorrect \\machine\dir structure build on Windows (TomZ)
+ Fixed: do not force a file to have an extension unless it has a known type (su
ch as html), or a possibly known type (if delayed checks are disabled)
+ Fixed: HTML 5 addition regarding "poster" attribute for the "video" tag (Jason
Ronallo)
+ Fixed: memory leaks in proxytrack.c (Eric Searcy)
+ Fixed: correctly set the Z flag in hts-cache/new.txt file (Peter)
+ Fixed: parallel patch, typo regarding ICONV_LIBS (Sebastian Pipping)
+ Fixed: memory leak in hashtable, that may lead to excessive memory consumption
+ Fixed: on Windows, fixed possible DLL local injection (CVE-2010-5252)
+ Fixed: UTF-8 conversion bug on Linux that may lead to buggy filenames
+ Fixed: zero-length files not being properly handled (not saved on disk, not up
dated) (lugusto)
+ Fixed: serious bug that may lead to download several times the same file, and
"Unexpected 412/416 error" errors
+ Fixed: images in CSS were sometimes not correctly detected (Martin)
+ Fixed: links within javascript events were sometimes not correctly detected (w
quatan)
+ Fixed: webhttrack caused bus error on certain systems, such as Mac OSX, due to
the stack size (Patrick Gundlach)
+ Fixed: bogus charset for requests when filenames have non-ascii characters
+ Fixed: bogus charset on disk when filenames have non-ascii characters
+ New: support for IDNA / RFC 3492 (punycode) handling
3.46-1
* New: Unicode filenames handling
* Fixed: fixed bug in handling of update/continue with erased files or renamed f
iles, leading to "Unexpected 412/416 error (Requested Range Not Satisfiable)" an
d/or "Previous cache file not found" (-1)" errors
* Fixed: escape characters >= 128 when sending GET/HEAD requests to avoid server
errors
* Fixed: do not use "delayed" extensions when the mirror is aborting
* Fixed: generate error pages when needed (Brent Palmer)
* Fixed: parsing issue with js files due to "script" tags (Vasiliy)
* Fixed: anonymous FTP without password (Luiz)
* Fixed: Makefile issues regarding parrallel build and examples (Sebastian Pippi
ng)
* Fixed: removed deprecated and annoying "Accept-Charset" header in requests (Pi
otr Engelking) (closes:#674053)
3.45-4
* New: source license is now GPLv3
* New: added a "K5" feature to handle transparent proxies (Brent Palmer)
* New: option -y to control ^Z behavior (Julian H. Stacey)
* Fixed: replace // by / when saving rather than _/ (Brent Palmer)
* Fixed: do not interpret ^C before mirror is finished, or after
* Fixed: webhttrack: do not use md5sum to produce a temporary filename, but mkte

mp (Ryan Schmidt)
* Fixed: document %k for custom structure (full query string)
3.45-3
* Fixed: spurious "Previous file not found (erased by user ?)" messages leading
to retransfer existing files in cache (Alain Desilets)
* Fixed: --max-time now stops the mirror smoothly (Alain Desilets)
3.45-2
* Fixed: number of simultaneous connections was often only one
* Fixed: "Unexpected 412/416 error" leading to have broken files on disk
3.45-1
* Fixed: interrupting an update/continue mirror session should not delete anymor
e previously downloaded content (William Roeder, Alain Desilets and many others)
* Fixed: --continue/--update bug leading to download again already cached data i
n some cases (especially redirect/error pages)
3.44-5
* Fixed: crash when using -d with non-fully-qualified hostname (Alain Desilets)
* Fixed: typo in logs (Pascal Boulerie)
3.44-4
* Fixed: random crash when interrupting the mirror (spotted by -fstack-protector
) in htscoremain.c (closes:#657878)
3.44-3
+ Fixed: Linux build (closes:#657334)
3.44-2
+ Fixed: malformed format htslib.c (Moritz Muehlenhoff)
+ Fixed: default footer print format
+ New: clever "^C" handling
+ New: added --do-not-generate-errors option
+ New: increased maximum cookie name
3.44-1
+ Fixed: Randomly corrupted files during updates due to "engine: warning: entry
cleaned up, but no trace on heap"/"Unexpected 412/416 error" errors (Petr Gajdus
ek ; closes:#614966)
3.43-12
+ Fixed: buffer overflow while repairing httrack cache if a damaged cache is fou
nd from a previous mirror (closes:#607704)
3.43-11
+ Fixed: webhttrack fixes for icecat (closes:#605140)
3.43-10
+ Fixed: capture URL not working properly when IPv6 is installed (John Bostelman
)
3.43-9
+ Fixed: application/xhtml+xml not seen as "html" (Peter Fritzsche)
+ Fixed: various linux fixes for desktop files (closes:#563691)
3.43-8
+ Fixed: URL encoding bugs with filenames containing '%' characters (sandboxie32
)
+ Fixed: MacPorts Darwin/Mac fixes to webhttrack (Ross Williams)

+ Fixed: Flash link extraction has been improved (Vincent/suei8423)


3.43-7
+ Fixed: "Open error when decompressing" errors due to temporary file generation
problems (Scott Mueller)
3.43-6
+ Shell: WIN32 setup cosmetic fixes: do not probe the proxy on non-local network
, do not force *.whtt registration
3.43-5
+ Fixed: code tag handling bug in certain cases leading to produce invalid links
(Tom-tmh13 and William Roeder)
3.43-4
+ Fixed: horrible SSL slowdowns due to bogus select() calls (Patrick Pfeifer)
+ Fixed: Konqueror fixes
3.43-3
+ Updated: Portugues-Brasil language file
3.43-2
+ Fixed: wizard question buggy, and commandline version did not print it (Maz)
+ Fixed: do not rename xml subtypes (such as xsd schemas) (Eric Avrillon)
3.43
+ Fixed: Fixed too aggressive javascript url= parsing (Chris)
+ Fixed: fixed --urllist option "sticking" the list content to the list of URL (
Andreas Maier)
+ Fixed: "Previous cache file not found" not redownloading file when deleted bef
ore an update (William Roeder)
+ Fixed: *.rpm.src files renamed to *.src.src with bogus servers (Hippy Dave)
+ Fixed: "pause" is pausing much faster (William Roeder)
+ Fixed: binary real media files and related files are no longer being parsed as
html (William Roeder)
+ Fixed: "File not parsed, looks like binary" check no longer corrupt the checke
d binary file
+ Fixed: multiple download of error pages (several identical '"Not Found" (404)
at link [identical link]') leading to a slowdown in certain cases (William Roede
r)
+ Fixed: sometimes, a double request was issued to update a broken file
+ Fixed: display bug "link is probably looping, type unknown, aborting .."
+ Fixed: missing library references at build time and other build related issues
(Debarshi Ray)
+ Fixed: on windows, switched from wsock32.dll to ws2_32.dll
+ Fixed: minor argument size validation error for "-O" option (Joan CALVET)
3.42-3
+ Fixed: Bad URL length validation in the commandline (CVE-2008-3429) (Joan CALV
ET)
3.42-2
+ Fixed: Random crashes at the end of the mirror due to a dangling file pointer
(Carlos, angus at quovadis.com.ar)
3.42
+ Fixed: size limits are stopping the mirror gently, finishing pending transfers
(David Stevenson)
3.41-3

+ Fixed: text/plain advertised files renamed into .txt


+ Fixed: broken configure.in
3.41-2
+ Fixed: major WIN32 inlined function bug caused the cache not to be used at all
, causing update not to work
3.41
+ New: changed API/ABI to thread-safe ones (libhttrack1 2), big cleanup in all .
h definitions
+ Fixed: Major memory usage bug when downloading large sites
+ Fixed: do not rename files if the original MIME type was compatible
+ Fixed: several source fixes for freeBSD (especially time problems)
+ New: option %w to disable specific modules (java, flash..)
+ Fixed: 'no space left in stack for back_add' error
+ Fixed: fixed redirected images with "html" type
+ Fixed: 'Crash adding error, unexpected error found.. [4268]' error
3.40-2
+ Fixed: bogus '.del' filenames with ISO-9660 option
+ Fixed: now merges the header charset even with an empty footer string
+ New: --port option for webhttrack
3.40
+ New: mms:// streaming capture (thanks to Nicolas Benoit!)
+ New: proxyTrack project released
+ New: new experimental parser that no longer needs link testing ('testing link
type..')
+ New: Redirect handled transparently with delayed type check and broken links m
ade external when the "no error page" option is enabled
+ New: improved background download to handle large sites
+ New: '--assume foo/bar.cgi=text/html' is now possible
+ New: MIME type scan rules (such as -mime:video/* +mime:video/mpeg)
+ New: size scan rules now allows to rewrite uncaught links as external links
+ Fixed: crash fixed when ftime()/localtime()==NULL
+ Fixed: iso-9660 option now using '_' for collision character
+ Fixed: collision problems with CaSe SeNsItIvItY
+ Fixed: a href='..' fixed!
+ Fixed: redirects are now handled by the new experimental parser
+ Fixed: "./" links generated with fixed outbound links (../../)
+ Fixed: 'base href' bogus in many cases
+ Fixed: enforce security limits to avoid bandwidth abuses
+ Fixed: bogus external (swf) parser, fixed remaining .delayed files
+ New: new check-mime and save-file2 callbacks
+ New: "always delayed type check" enabled
+ Fixed: totally bogus finalizer causing compressed files not to be uncompressed
, and many files to be truncated
+ Shell: new Finnish interface added!
+ Fixed: "..html" bogus type
+ Fixed: remaining bogus .delayed entries
+ Fixed: flush before user-defined command
+ Fixed: fixed user-defined command call and background cleaner
+ Fixed: fixed 'Crash adding error, unexpected error found.. [4250]' error
+ Fixed: fixed cache absolute file reference (the reference is now relative) pre
venting the cache form being moved to another place
+ Fixed: webhttrack 'Browse Sites' path bug
+ Fixed: old httrack cache format fixes (import of older versions did not work a
nymore)
+ Fixed: port fixes in htsnet.h
+ Fixed: -N option with advanced extraction (bogus "not found" member)
+ Fixed: javascript: location=URL was not recognized

+ Fixed: no more character escaping when not needed (such as UTF-8 codes)
+ Fixed: possibly temporary files left on disk with bogus servers giving compres
sed content on HEAD reuests
+ Fixed: URL hack caused unexpected filename collisions (index.html vs INDEX.HTM
L)
+ Fixed: "do not erase already downloaded file" option now correctly works (it l
eaves files linked in the mirror)
+ Fixed: UCS2 encoded pages are now converted properly into UTF-8
+ New: "near" option now also catch embedded (images, css, ..) files
+ Fixed: bogus chunked multimedia link text files (such as x-ms-asf files)
+ Fixed: compilation problems on Un*x version
3.33
+ Fixed: Bogus redirects with same location in https
+ Fixed: Bogus file naming with URL hack
+ Fixed: Extremly slow redirections and empty files
+ Fixed: Bogus names with directories ending with a "."
+ New: Number of connection per second can now be.. decimal, to delay even more
+ New: Enforce stronger ISO9660 compliance
+ Shell: "URL Hack" in interface
+ Shell: "Save settings" now rebuild categories
+ Shell: "Shutdown PC after mirror" option
+ Shell: Sound at the beginning/end or the mirror (configurable through system s
ound properties)
+ Shell: Fixed drag & drop, .url import
+ Shell: Fixed "wizard" mode (crash)
+ Fixed: Crash at the end due to unterminated pending threads
+ Fixed: \ is not anymore transformed into / after the query (?) delimiter
+ New: Two new callbacks for pre/post-processing html data
+ New: link-detected2 callback (additional tag name parameter)
+ Fixed: Broken ISO9660
+ Fixed: Crash on file:// links
+ Fixed: Unescaped ampersands (&) in URLs
+ Fixed: Transfer hangs introduced in 3.33-beta-2
+ Fixed: Display bug "Waiting for scheduled time.."
+ Fixed: Bug "Waiting for scheduled time.." (NOT a display bug, actually)
+ Fixed: CaSe SenSiTiViTy bugs with mutliple links reffering to the same URL but
using different case styles
+ Fixed: Failed to build from sources (FTBFS) on amd64 archs because of cast pro
blems (Andreas Jochens)
+ Fixed: & were converted into   (Leto Kauler)
+ Shell: Fixed crash with long URL lists (Libor Striz)
+ Fixed: connection/seconds limiter replugged
+ Fixed: "no files updated" display bug
+ Fixed: bogus links encoded with UTF (Lukasz Wozniak)
+ New: --assume can be used to force a specific script type (Brian Schrder)
3.32
+ Fixed: css and js files were not parsed!
+ Fixed: again broken file:// (infinite loops with local crawls)
+ Fixed: Bandwidth limiter more gentle with low transfer rate
+ Fixed: external wrappers were not called during updates/continue
+ New: additional callback examples
+ Fixed: overflow in unzip.c fixed
+ New: tests are now cached for better performances!
+ New: %r (protocol) option for user-defined structure
+ Fixed: Broken engine on 64-bit archs
3.31
+ New: Experimental categories implemented

+ New: New cache format (ZIP file)


+ New: .m3u files now crawled
+ New: .aam files now crawled
+ Fixed: Broken ftp
+ Fixed: Broken file://
+ Fixed: Broken cookies management and loading
+ Fixed: HTTrackInterface.c:251 crash
+ Fixed: "N connections" means "N connections" even in scan phase
+ Fixed: javascript:location bug
+ Fixed: libtool versioning problem fixed
+ Fixed: More javascript bugs with \' and \"
+ Fixed: .HTM files not renamed into .html anymore
+ Fixed: OSX fixes in the Makefile script
+ New: Default "referer" and "from" fields
+ New: Full HTTP headers are now stored in cache
+ Fixed: ftp transfer not logged/properly finalized
+ Fixed: Missing symbolic link in webhttrack install
+ Fixed: path and language not saved in webhttrack
+ Shell: Avoid invalid project names
+ Fixed: Javascript bug with src=
+ Fixed: Keep-alive consistency problems on Linux/Unix with bogus servers (SIGPI
PE)
+ Fixed: Parsing bug inside javascript (bogus parsing with empty quotes in funct
ion: foo(''))
+ Fixed: static compiling on Linux/Unix
+ Fixed: bloated .h headers (internal function definitions)
+ Fixed: Bogus query strings with embedded ../ and/or ./
+ New: Added "change-options" call in the crawl beginning
+ New: Query arguments now sorted for normalized URL checks (when "url hack" opt
ion is activated)
+ Fixed: Previous dependency to zlib.dll to zlib1.dll
+ Fixed: Broken static files were not correctly updated with the new cache forma
t
+ Shell: Launch button in Internet Explorer
+ Fixed: Crash when dealing with multiple '?' in query string with 3.31-alpha
3.30
+ New: Webhttrack, a linux/unix/bsd Web GUI for httrack
+ New: "URL hack" feature
+ New: HTTP-headers charset is now propagated in the html file
+ New: loadable external engine callbacks
+ New: Experimental ".mht" archives format
+ Fixed: Query ?? bug
+ Fixed: Bogus base href without http://
+ Fixed: Several javascript bugs
+ Fixed: UCS2 pages badly detected
+ Fixed: Build structure change does not redownload files
+ Fixed: "?foo" URL bug (link with only a query string) fixed
+ Fixed: ' or " inside non-quoted URLs
+ Fixed: keep-alive problems with bogus servers
+ Fixed: Broken .ra files
+ Fixed: More javascript bugs
+ Fixed: ftp transfers not properly monitored in the shell
+ Fixed: various fixes in webhttrack
+ Fixed: Blank final page in webhttrack
+ Fixed: Javascript comments (//) are skipped
+ Fixed: Temporary fix for "archive" bug with multiple java archives
+ Fixed: Inlined js or css files have their path relative to the parent
+ Fixed: Unescaped quotes ("") when continuing/updating in commandline mode
+ Fixed: Null-character in html page bug

+ Fixed: External depth slightly less bogus


+ Fixed: Filters based on size bogus ("less than 1KiB" is now functionning)
+ Fixed: Strange behaviour with filters (last filter "crushed")
+ Fixed: Bogus downloads when using ftp (unable to save file)
+ Fixed: Freeze with keep-alive on certain sites due to bad chunk encoding handl
ing
+ Fixed: Problems with javascript included paths
+ Fixed: The mirror now aborts when the filesystem is full
+ Fixed: "No external pages" option fixed
+ Fixed: Javascript and \" in document.write bug fixed
+ Fixed: Two memory leaks in temporary file generation, and in link build fixed
+ Fixed: Bogus compression with non-gzip format
+ Fixed: Larger range of charsets accepted
+ Fixed: Bogus robots.txt when using comments (#)
+ Fixed: Missing MIME types for files such as .ico
+ Shell: Fixed continuous proxy search
+ Shell: Fixed missing HelpHtml/ link
+ Fixed: Overflow in htsback.c:2779
+ Fixed: Bogus style and script expressions due to too aggressive parsing
+ Fixed: Javascript parsing bugs with \" and \'
+ Fixed: Javascript link detection bugs when comments were inserted between argu
ments
+ Fixed: Bug when valid empty gzip content was received
+ New: More aggressive "maximum mirroring time" and "maximum amount of bytes tra
nsfered" options
+ New: Windows file://server/path syntax handled
+ Fixed: mht archive fixes
+ Fixed: Serious bugs with filters given in commandline erased by the engine
+ Fixed: Bogus parsing of javascript: generated inside document.write() inside j
avascript code removed
3.23
+ New: Keep-alive
+ New: URLs size limit is now 1024 bytes
+ New: Bogus UCS2 html files hack
+ Fixed: base href bugs
+ Fixed: windows "dos devices" bug fixed
+ Fixed: dirty parsing now avoids ","
+ Fixed: "get non-html files near a link" option sometimes caused huge mirrors
+ Fixed: Bugs if zlib library is not found
+ Fixed: Bug with "near" and "no external pages"
+ Fixed: "Link empty" crash
+ Fixed: Several javascript bugs
+ Fixed: Keep-alive problems ("unknown response structure")
+ Fixed: Major keep-alive bug (connection not closed)
+ Fixed: 8-3 options not working, ISO9660 option improved
+ Fixed: Bogus links with embedded CR, TAB..
+ Fixed: small ../ external link bug fixed
3.22-3
+ Fixed: Slow engine due to (too strict) memory checks
+ Fixed: Overflow in htscore.c:2353
+ Fixed: Bogus chunked files with content-length fixed
+ Fixed: Folders renamed into ".txt" on Un*x platforms bug fixed!
+ New: Scan rule list (-%S) added
+ New: Cache debugging tool (-#C) added
3.21-8
+ New: Basic Macromedia Flash support (links extraction)
+ New: Modular design for https, flash parser and zlib

+
+
+
+
+
+
+
+

New: Standard autoconf/configure design on Un*x platforms


New: Modular design also on Windows platforms (dll/lib)
Fixed: Text files without extension not renamed "html" anymore
Fixed: Bug with "?foo" urls
Fixed: No chmod 755 on home anymore
Fixed: Stability problems due to bad file structure checks
Fixed: Overflow in GUI/commandline when displaying statistics
Fixed: Directory creation error

3.20-2
+ New: HTTPS support (SSL)
+ New: ipv6 support
+ New: 'longdesc' added
+ New: new file 'new.txt' generated for transfer status reports
+ New: ISO9660 compatibility option
+ New: empty mirror/update detection improved
+ New: Update hack now recognizes "imported" files
+ New: Option to disable ipv4/ipv6
+ New: Filters now recognize patterns like -https://*
+ Fixed: The engine should be now fully reentrant
+ Fixed: Fixes for alpha and other 64-bit systems
+ Fixed: Files downloaded twice if not found in cache
+ Fixed: ftp problems with 2xx responses
+ Fixed: ftp problems with multiple lines responses
+ Fixed: ftp %20 not escaped anymore
+ Fixed: ftp RETR with quotes problems
+ Fixed: now tolerent to empty header responses
+ Fixed: hts-log closed
+ Fixed: Compressed pages during updates
+ Fixed: Crash when receiving empty compressed pages
+ Fixed: Random crashes in 'spider' mode
+ Fixed: bcopy/bzero not used anymore..
+ Fixed: various code cleanups
+ Fixed: Better UTF8 detection
+ Fixed: External links now work with https and ftp
+ Fixed: Top index.html corrupted or missing
+ Fixed: URL list crashes
+ Fixed: Random crashes with large sites due to bogus naming handler
+ Fixed: Freezes on some robots.txt files
+ Fixed: Compressed files not stored
+ Fixed: SVG fixes
+ Fixed: Raw HTML responses
+ Fixed: 406 error workaround
+ Fixed: Crashes due to binary files with bogus HTML type (not parsed anymore)
+ Fixed: External https and ftp links broken, relative https links broken
+ Fixed: Automatic resizing of filter stack
+ Fixed: Various ampersand (&) elements added
+ Fixed: https with proxy temporary workaround (direct connection)
+ Fixed: "base href" with absolute uris
+ Fixed: stack frame too large on some systems
+ Fixed: random bad requests due to bogus authentication
+ Shell: Several fixes, including registration type problems
+ Shell: "template files not found" fixed
3.16-2
+ Fixed:
+ Fixed:
+ Fixed:
+ Fixed:
+ Shell:

Zlib v1.4
Gzipped files now downloaded without problems (HTTP compression bug)
Ending spaces in URLs now handled correctly
META-HTTP bug
Type registration done only once

3.15
+ Fixed:
+ Fixed:
+ Fixed:
+ Fixed:
+ Fixed:
+ Shell:
+ Shell:

Bogus HTTP-referer with protected sites


Fatal IO/socket error with large sites (handles not closed)
K4 option now works
--continue+URL(s) now clears previous URLs
Parsing bug with 'www.foo.com?query'
'Soft cancel' documented
'Kx' options added

3.10
+ Fixed: Broken pipes on Linux version
+ Fixed: Commandline version bug with gzipped files
+ Fixed: Crash when reaching compressed error pages
+ Fixed: Bogus html-escaped characters in query strings
+ Fixed: Files skipped (bogus anticipating system)
+ Fixed: Crash when showing stats (div by zero)
+ Fixed: Problems with URLs/redirects containing spaces or quotes
+ Fixed: Slash added when ~ detected
+ Fixed: Ugly VT terminal
+ New: Faster and cleaner mirror interrupt
3.09
+ Fixed: Several problems with javascript parsing
+ Fixed: Elements after onXXX not parsed
+ New: Source update wrapper
+ New: Style url() and @import parsed
+ Shell: Word database and maximum number of links
+ Shell: Option changes taken in account immediately
+ Shell: Cleaner installer (registry keys)
3.08
+ New: HTTP compression is now supported
+ New: Faster response analysis
+ Fixed: External page in html if cgi
+ Fixed: Mix between CR and CR/LF for comments
+ Fixed: Top index corrupted
+ Shell: Better refresh during parsing
+ Shell: DLL error
3.07
+ Fixed: Random crashes with HTTP redirects
+ New: New rate limiter (should be sharper)
+ New: Code cleaned up, new htscore.c/httrack.c files
3.06
+ Fixed: Redirect to https/mailto now supported
+ New: Top index/top dir for Un*x version
+ New: Sources more modular (.so)
+ New: Quicktime targetX= tags
+ New: HTTP 100 partially supported
3.05
+ Fixed: Non-scannable tag parameters ("id","name",..)
+ Fixed: Java classes not found when using "." as separator
+ Fixed: Java classes not found when missing .class
3.04
+ Fixed: URLs with starting spaces
+ Fixed: bogus URLs when using "base href"

+ Shell: --assume and -%e options included


+ New: Documentation updated a little
3.03
+ New: Parser optimizations, 10 times faster now!
+ New: New --assume option to speed up cgi tests
+ New: Option to avoid Username/password storage for external pages
+ New: Query string kept for local URIs
+ Fixed: RFC2396 compliant URLs accepted (//foo and http:foo)
+ Fixed: foo@foo.com not considered as URL anymore
+ Fixed: Space encoded into %20 in URIs
+ Fixed: "Unable to save file" bug
+ Fixed: Corrupted top index.html
+ Fixed: Cookies disabled with --get
+ Fixed: Cache bug for error pages
3.02
+ Fixed: Pages without title recorded in top index
+ Fixed: Error with Content-type+Content-disposition
+ Fixed: backblue.gif/external.html files not purged anymore
+ Fixed: Encoding problems with files containing %2F or other characters
+ Fixed: Write error reported for HTML files
+ New: hts-stop.lock file to pause the engine
+ New: New install system using InnoSetup
3.01
+ New: HTTP real media files captured
+ Fixed: Bogus statitics
+ Fixed: Minor fixes
3.00
+ New: New interface, with MANY improvements!
+ New: Better parsing (enhanced javascript parsing, sharper HTML parsing)
+ New: Faster and more efficient background download system
+ New: ETag properly handled
+ New: Optional URL list
+ New: Optionnal config file
+ New: New structure options
+ New: New filters options (size filters)
+ New: Better password site handling
+ New: Traffic control to avoid server overload
+ New: Setuid and Chroot for Unix release
+ New: limited 64-bit handling
+ New: .js files are now parsed
+ New: Single hts-log.txt file, error level
+ New: New top index.html design
+ New: "Update hack" option to prevent unnecessary updates
+ New: Default language sent for mirrors
+ New: Searchable index
+ Fixed: Bogus ftp routines (Linux version)
+ Fixed: Bug that caused to mirror a complete site from a subdir
+ Fixed: Bug that caused restart to be very slow
+ Fixed: Bug that caused loops on several query-string pages (?foo=/)
+ Fixed: Corrupted cache bug
+ Fixed: Random broken links (pages not downloaded)
+ Fixed: Shared links problems
+ Fixed: Bogus URLs with commas (,)
+ Fixed: Bogus / and \ mixed
+ Fixed: Bogus addresses with multiple @
+ Fixed: Bogus links with %2E and %2F

+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+

Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Fixed:
Shell:
Shell:
Shell:
Shell:
Fixed:

Bogus empty links


"Unexpected backing error" bug fixed
Files with incorrect size no more accepted
Top index.html created even for untitled pages
Bogus N100 option (unable to save file)
Deadlock when using many hosts in URLs
Password stored internally to avoid access errors
Fixed /nul DOS limit
Bogus -* filter (nothing mirrored)
.shtml now renamed into .html
Content-disposition without ""
External html page for /foo links
Username/password % compliant
Javascript parser sometimes failed with " and ' mixed
Some Range: bugs when regeting complete files
Range: problems with html files
HTTP/1.1 407 and 416 messages now handled
Bogus timestamp
Null chars in HTML bug
Error pages cache bug
Connect error/site moved do not delete everything anymore!
Bogus garbage ../ in relative URL
New transfer rate estimation
Fixed crash when using verbose wizard
dynamic lang.h for easier translation updates
Fixed some options not passed to the engine
A lots of minor fixes!

2.2
Note: 3.00 alpha major bug fixes are included in the 2.2
2.02
+ New: Cache system improved, compatible with all platforms
+ New: Update process improved (accurate date)
+ New: Remote timestamp for files
+ New: ETag (HTTP/1.1) supported
+ Shell: Portugese interface available
+ Fixed: Bug with links containing commas
+ Fixed: 'file://' bug with proxy
+ New: Engine a little bit faster
+ Shell: Some bugs fixed in the interface
2.01
+ New: ftp through proxy finally supported!
+ New: Sources cleaned up
+ New: Again some new marvelous options
+ New: Speed improved (links caught during parsing, faster "fast update")
+ New: Tool to catch "submit" URL (forms or complex javascript links)
+ Shell: German interface available
+ Shell: Dutch interface available
+ Shell: Polish interface available
+ Fixed: Level 1 bug fixed
+ Fixed: Still some parsing/structure problems
+ Fixed: Referer now sent to server
+ Fixed: Cookies did not work properly
+ Fixed: Problems with redirect pages
+ New: Better javascript parsing
+ Fixed: Problems with URL-parameters (foo.cgi?param=2&choice=1)
+ Fixed: Problems with ftp
+ New: ftp transfers are now in passive mode (firewall compliant)

2.00 -- The First Free Software Release of HTTrack!


+ New: HTTrack sources (command line), now free software, are given
+ Shell: Interface rewritten!
+ New: Documentation rewritten
+ Shell: Drag&Drop abilities
+ Shell: More URL informations
+ Shell: Fixed: Remote access problems
+ Fixed: Loop problems on some sites causing crashes
+ Fixed: URL encoding problems
+ Fixed: Some file access problems for ../
+ Fixed: Some fixes for updating a mirror
+ Shell: Crazy progress bar fixed
+ Fixed: Form action are rewritten so that cgi on form can work from an offline
mirror
+ Fixed: Crashes after continuing an "hand-interrupted" mirror
+ Fixed: Bogus files with some servers (chunk bug)
1.30
+ Shell: Interface improved
+ New: robots.txt are followed by default
+ New: Parsing speed improved on big (>10,000 links) sites with an hash table
+ New: Mirror Link mode (mirror all links in a page)
+ New: Cookies are now understood
+ New: No external pages option (replace external html/gif by default files)
+ New: Command line version improved, background on Unix releases
+ Fixed: Problems with javascript parsing
+ Fixed: Username/password not set to lowercase anymore
+ Fixed: Problems with base href
+ New: Links in level 1 html files now patched
+ New: Expurge now deletes unused folders
+ New: Option -V executes shell command for every new file
+ Shell: Primary filter now works
1.24
+ Fixed: Ftp protocol bogus (with login/pass)
+ Fixed: Cache problems (corrupted files)
+ New: Expurge old files for updates
+ New: "Updated" messages for mirror updates
+ Shell: Autodial/hangup option to RAS
+ Fixed: index.html were not created sometimes
+ Shell: Fixed: Random crashes with the interface
+ Shell: Fixed: Filters profile not saved
+ Fixed: Various (and numerous) fixes
1.23
+ Shell:
+ Shell:
+ Shell:
+ Shell:

Interface improved
Multiple setups
Redefine options
Continue interrupted mirror improved

1.22
+ Fixed: Parsing up/down did not work well
+ Fixed: Several files not catched, bugs fixes
+ Fixed: Problems with classes (1.21)
+ New: Transfer rate can be limited (-A option)
+ Shell: Smooth refresh
+ New: ftp basic protocol a little bit improved
1.21

+ Fixed: Several java classes were not parsed


+ Fixed: Some folders without ending / ignored
+ Fixed: Crashes due to content-type too long
1.20
+ Shell: documentation!
+ Fixed: Some problems with 'host cancel' system after timeouts (crashes)
+ New: Get only end of files if possible (file partially get)
+ New: New cache system (only HTML stored)
+ New: User-defined structure possible
+ New: Also available: french interface
+ Fixed: Random crashes (div by 0/illegal instruction) with null size files
+ New: Limited ftp protocol (files only), e.g. +ftp://* now works
+ Fixed: Some connect problems with several servers or proxies
+ New: New option, save html error report by default
+ Shell: Browse and see log files at the end of a mirror
+ New: Proxy authentication (ex: guest:star@myproxy.com:8080)
+ Shell: Interface improved (especially during mirror)
+ Fixed: Ambiguous files are renamed (asp,cgi->html/gif..)
+ Shell: New test link mode option
+ New: Site authentication (ex: guest:star@www.myweb.com/index.html)
+ Fixed: Minor bugs fixed
+ Shell: See log files during a mirror
+ Fixed: Some problems using CGI (different names now)
+ Fixed: Go down/up/both options and filters
+ Fixed: "Store html first" did not work
+ New: -F option ("Browser ID") disguise HTTrack into a browser
+ New: New filter system
+ Shell: New "Save as default" options
+ Fixed: "Build options" did NOT work properly! (files overwritten or missing)
+ Fixed: User agent ID fixed
+ Shell: Skip options
+ Shell: Better interface control during mirrors
+ Shell: InstallShield and Help files
+ Fixed: Some external links were not filtered sometimes
+ Fixed: Mirror crash at the end
1.16b
+ Shell: Really *stupid* bug fixed causing WinHTTrack to be slooow
+ Fixed: Crash if the first page has no title fixed
+ Fixed: Bogus options like "Just scan" saved empty files
+ Fixed: Forbid all links (*) with manual accept did not work
+ Shell: Filters interface improved
1.16:
+ New : Java Classes and subclasses are now retrieved!
+ New: Better JavaScripts parsing
+ New: Option: Abandon slowest hosts if timeout/transfer too slow
+ Shell: Interface improved
1.15b
+ Fixed: Some bugs fixed
1.15:
+ Shell: Interface improved
+ New: Robot improved (some files through javascript are now detected!)
+ New: Improved wild cards (for example, +www.*.com/*.zip)
+ New: 'config' file to configurate proxy, path.. only once
1.11
+ New: Wait for specific time (begin transfer at specific hour)
+ New: Time limit option (stops transfer after x seconds)

+ Shell: Interface improved for an easy use


1.10e
+ Fixed: Maps were not correctly managed (stupid bug)
1.10d:
+ Fixed: Bogus index.html fixed
1.10c
+ Shell: "Time out" field needed "transfer rate" field
1.10b
+ Fixed: Better memory management
1.10
+ New: "Transfer rate out" option added (abandon slowests sites)
+ New: "Deaf" hosts do not freeze HTTrack any more
+ Fixed: Again problems with code/codebase tags
+ New: Broken links detection improved
1.04
+ Fixed:Some links were not correctly read (pages with "codebase" tags)
+ Shell: Interface improved
1.03 (No changes for the command-line robot)
+ Shell: Big bug fixed! (VERY slow transfer rates..)
1.02
+ Fixed: Some java files were not correctly transfered
+ New: Speed has been improved
+ Fixed: Log file more accurate
+ Shell: Interface has been improved
1.01
+ Fixed: Structure check error in some cases
1.00 -- The 1.00, Yeah!
+ New: base and codebase are now scanned
0.998 beta-2
+ Fixed: Multiple name bug (files having the same name in the same directory) wi
th -O option fixed
0.997 beta-2
+ Fixed: Filenames with '%' were not correctly named
+ Fixed: Bug detected in 0.996: several files are not written on disk!!
0.996 beta-2
+ New: -O option (path for mirror and log)
+ New: Unmodified file time/date are not changed during an update
0.99 beta-2
+ New: User-agent field
+ New: Shortcuts (--spider etc.)
+ New: Links not retrieved are now rebuilt absolutly
+ New: The 'g' option (juste get files in current directory) has been added
+ New: Primary links analyste has been improved
+ Fixed: "304" bug fixed
0.25 beta-2
+ Fixed: Freeze during several mirrors fixed!
+ New: More 'N' options (filenames type)
0.24 beta-2

+ Fixed: Restart/Update with cache did not work (really not..)


+ Fixed: Wild cards now work properly (e.g. +www.abc.com* do works)
+ New: The 'n' option (get non-html files near a link) has been added!
0.23 beta-2
+ Fixed: The 'M' option (site size) did not work
+ Fixed: Files larger than 65Kb were not correctly written
older beta
+ Many, many bugs fixed

Das könnte Ihnen auch gefallen