Nmap Development mailing list archives
Re: html page extensions
From: Michael Pattrick <mpattrick () rhinovirus org>
Date: Mon, 14 Sep 2009 00:47:07 -0400
Don't forget about aspx and jhtml. However, wouldn't it just be easier to check mime types? Many sites use non-standard or seemingly random extensions: webct uses .dowebct, sometimes .do is used for cgi scripts such as 'http://www.ic.gc.ca/app/opic-cipo/trdmrks/srch/tmSrch.do?lang=eng'. I suppose that it requires more overhead, as you would have to download the header of every link as opposed to just specific ones... -M On Sun, Sep 13, 2009 at 11:48 PM, Patrick Donnelly <batrick () batbytes com> wrote:
Hi nmap-dev, I'm working on an http spider script and need to know what file extensions are common for html pages. Here's a list I have so far (in Lua regular expressions): local html_page_extensions = { "%.html$", -- regular html page "%.htm$", -- regular html page "%.shtml$", -- regular html page "%.phtml$", -- regular html page "%.php$", -- php "%.pl$", -- perl "%.cgi$", -- cgi "%.jsp$", -- Java Server Pages "%.asp$", -- Active Server Pages (Microsoft) }; I'm also checking pages that have no extension (as that is apparently very common). Does anyone have more to add? -- -Patrick Donnelly "Let all men know thee, but no man know thee thoroughly: Men freely ford that see the shallows." - Benjamin Franklin _______________________________________________ Sent through the nmap-dev mailing list http://cgi.insecure.org/mailman/listinfo/nmap-dev Archived at http://SecLists.Org
_______________________________________________ Sent through the nmap-dev mailing list http://cgi.insecure.org/mailman/listinfo/nmap-dev Archived at http://SecLists.Org
Current thread:
- html page extensions Patrick Donnelly (Sep 13)
- Re: html page extensions Michael Pattrick (Sep 13)
- Re: html page extensions jah (Sep 14)