Hacking, hacking news, Hacking Tools, Top News, Vulnerability

HTTrack – Website Downloader Copier & Site Ripper Download

>httrack help

HTTrack version 3.03BETAo4 (compiled Jul  1 2001)

usage: ./httrack ] []

with options listed below: (* is the default value)

General options:

  O  path for mirror/logfiles+cache (O path_mirror[,path_cache_and_logfiles]) (path )

%O  top path if no path defined (O path_mirror[,path_cache_and_logfiles])

Action options:

  w *mirror web sites (mirror)

  W  mirror web sites, semiautomatic (asks questions) (mirrorwizard)

  g  just get files (saved in the current directory) (getfiles)

  i  continue an interrupted mirror using the cache

  Y   mirror ALL links located in the first level pages (mirror links) (mirrorlinks)

Proxy options:

  P  proxy use (P proxy:port or P user:pass@proxy:port) (proxy )

%f *use proxy for ftp (f0 don‘t use) (–httpproxy-ftp[=N])

Limits options:

  rN set the mirror depth to N (* r9999) (–depth[=N])

%eN set the external links depth to N (* %e0) (–ext-depth[=N])

  mN maximum file length for a non-html file (–max-files[=N])

  mN,N’                  for non html (N) and html (N‘)

  MN maximum overall size that can be uploaded/scanned (–max-size[=N])

  EN maximum mirror time in seconds (60=1 minute, 3600=1 hour) (–max-time[=N])

  AN maximum transfer rate in bytes/seconds (1000=1kb/s max) (–max-rate[=N])

%cN maximum number of connections/seconds (*%c10)

  GN pause transfer if N bytes reached, and wait until lock file is deleted (–max-pause[=N])

Flow control:

  cN number of multiple connections (*c8) (–sockets[=N])

  TN timeout, number of seconds after a non-responding link is shutdown (–timeout)

  RN number of retries, in case of timeout or non-fatal errors (*R1) (–retries[=N])

  JN traffic jam control, minimum transfert rate (bytes/seconds) tolerated for a link (–min-rate[=N])

  HN host is abandonned if: 0=never, 1=timeout, 2=slow, 3=timeout or slow (–host-control[=N])

Links options:

%P *extended parsing, attempt to parse all links, even in unknown tags or Javascript (%P0 don’t use) (extendedparsing[=N])

  n  get nonhtml files ‘near’ an html file (ex: an image located outside) (near)

  t  test all URLs (even forbidden ones) (test)

%L )

Build options:

  NN structure type (0 *original structure, 1+: see below) (structure[=N])

     or user defined structure (N “%h%p/%n%q.%t”)

  LN long names (L1 *long names / L0 83 conversion) (longnames[=N])

  KN keep original links (e.g. http://www.adr/link) (K0 *relative link, K absolute links, K3 absolute URI links) (–keep-links[=N])

  x  replace external html links by error pages (replaceexternal)

%x  do not include any password for external password protected websites (%x0 include) (nopasswords)

%q *include query string for local files (useless, for information purpose only) (%q0 don‘t include) (–include-query-string)

  o *generate output html file in case of error (404..) (o0 don’t generate) (generateerrors)

  X *purge old files after update (X0 keep delete) (purgeold[=N])

Spider options:

  bN accept cookies in cookies.txt (0=do not accept,* 1=accept) (cookies[=N])

  u  check document type if unknown (cgi,asp..) (u0 don‘t check, * u1 check but /, u2 check always) (–check-type[=N])

  j *parse Java Classes (j0 don’t parse) (parsejava[=N])

  sN follow robots.txt and meta robots tags (0=never,1=sometimes,* 2=always) (robots[=N])

%h  force HTTP/1.0 requests (reduce update features, only for old servers or proxies) (http10)

%B  tolerant requests (accept bogus responses on some servers, but not standard!) (tolerant)

%s  update hacks: various hacks to limit retransfers when updating (identical size, bogus response..) (updatehack)

%A  assume that a type (cgi,asp..) is always linked with a mime type (%A php3=text/html) (assume )

Browser ID:

  F  useragent field (F “user-agent name”) (useragent )

%F  footer string in Html code (%F “Mirrored [from host %s [file %s [at %s]]]” (footer )

%l  preffered language (%l “fr, en, jp, *” (language )

Log, index, cache

  C  create/use a cache for updates and retries (C0 no cache,C1 cache is prioritary,* C2 test update before) (cache[=N])

  k  store all files in cache (not useful if files on disk) (storeallincache)

%n  do not redownload locally erased files (donotrecatch)

%v  display on screen filenames downloaded (in realtime) (display)

  Q  no log quiet mode (donotlog)

  q  no questions quiet mode (quiet)

  z  log extra infos (extralog)

  Z  log debug (debuglog)

  v  log on screen (verbose)

  f *log in files (filelog)

  f2 one single log file (singlelog)

  I *make an index (I0 don‘t make) (–index)

%I  make an searchable index for this mirror (* %I0 don’t make) (searchindex)

Expert options:

  pN priority mode: (* p3) (priority[=N])

      0 just scan, don‘t save anything (for checking links)

      1 save only html files

      2 save only non html files

     *3 save all files

      7 get html files before, then treat other files

  S  stay on the same directory

  D *can only go down into subdirs

  U  can only go to upper directories

  B  can both go up&down into the directory structure

  a *stay on the same address

  d  stay on the same principal domain

  l  stay on the same TLD (eg: .com)

  e  go everywhere on the web

%H  debug HTTP headers in logfile (–debug-headers)

Guru options: (do NOT use)

#0  Filter test (-#0 ‘*.gif‘ ‘www.bar.com/foo.gif‘)

#f  Always flush log files

#FN Maximum number of filters

#h  Version info

#K  Scan stdin (debug)

#L  Maximum number of links (-#L1000000)

#p  Display ugly progress information

#P  Catch URL

#R  Old FTP routines (debug)

#T  Generate transfer ops. log every minutes

#u  Wait time

#Z  Generate transfer rate statictics every minutes

#!  Execute a shell command (-#! “echo hello”)

Command-line specific options:

  V execute system command after each files ($0 is the filename: -V “rm \$0”) (–userdef-cmd )

%U run the engine with another id when called as root (-%U smith) (–user )

Details: Option N

  N0 Site-structure (default)

  N1 HTML in web/, images/other files in web/images/

  N2 HTML in web/HTML, images/other in web/images

  N3 HTML in web/,  images/other in web/

  N4 HTML in web/, images/other in web/xxx, where xxx is the file extension

(all gif will be placed onto web/gif, for example)

  N5 Images/other in web/xxx and HTML in web/HTML

  N99 All files in web/, with random names (gadget !)

  N100 Site-structure, without www.domain.xxx/

  N101 Identical to N1 exept that “web” is replaced by the site’s name

  N102 Identical to N2 exept that “web” is replaced by the site‘s name

  N103 Identical to N3 exept that “web” is replaced by the site’s name

  N104 Identical to N4 exept that “web” is replaced by the site‘s name

  N105 Identical to N5 exept that “web” is replaced by the site’s name

  N199 Identical to N99 exept that “web” is replaced by the sites name

  N1001 Identical to N1 exept that there is no “web” directory

  N1002 Identical to N2 exept that there is no “web” directory

  N1003 Identical to N3 exept that there is no “web” directory (option set for g option)

  N1004 Identical to N4 exept that there is no “web” directory

  N1005 Identical to N5 exept that there is no “web” directory

  N1099 Identical to N99 exept that there is no “web” directory

Details: Userdefined option N

  %n Name of file without file type (ex: image) (donotrecatch)

  %N Name of file, including file type (ex: image.gif)

  %t File type (ex: gif)

  %p Path [without ending /] (ex: /someimages)

  %h Host name (ex: www.someweb.com) (http10)

  %M URL MD5 (128 bits, 32 ascii bytes)

  %Q query string MD5 (128 bits, 32 ascii bytes)

  %q small query string MD5 (16 bits, 4 ascii bytes) (includequerystring)

     %s? Short name version (ex: %sN)

  %[param] param variable in query string

Shortcuts:

mirror      

Source : DarkNet

Previous ArticleNext Article
Founder and Editor-in-Chief of 'Professional Hackers India'. Technology Evangelist, Security Analyst, Cyber Security Expert, PHP Developer and Part time hacker.

Send this to a friend