Nmap Development mailing list archives

[NSE] HTML parser library and cheers


From: Giacomo Mantani <giacomo.mantani () studio unibo it>
Date: Sat, 12 Mar 2016 23:14:20 +0000

Hi all,

I am Giacomo Mantani, a master degree student at University of Bologna.
I am testing my scripting skills before writing an application for this SoC.

In soc () nmap org list I have exchanged an email with Daniel Miller about the HTML
parser library and using LPeg seems a good choice. I have preferred the
SNOBOL-like modus operandi, that allow us to define patterns incrementally.
I saw that Patrick (nmap/todo/patrick.txt) is working on LPeg ATM, maybe we
could collaborate.

I have already made a skeleton https://gist.github.com/jak3/f34d686e7c8965dd7532
that catches tag "title", HTML and Javascript comments (case insensitive).
I do not apply and honouring your DocBook XML comment standard yet but if
the script will be official I will.

These are the NSE scripts that already could benefit from it:

* http-title
* http-traceroute, extract_title (regex is really weak, case-sensitive)
* http-comments-displayer

Further improvements could be made implementing functionalities that are used
by:

* http.lua
* httpspider
* http-favicon
* http-affiliate-id

I am more interested in Vulnerability and exploitation specialist role, but
where/what to code is pretty much the same.

Let me know your thoughts and if there are incoming high priority scripts that
could be developed in the meantime.

Thanks in advance,
Giacomo
_______________________________________________
Sent through the dev mailing list
https://nmap.org/mailman/listinfo/dev
Archived at http://seclists.org/nmap-dev/


Current thread: