Server IP : 127.0.0.2 / Your IP : 3.142.242.51 Web Server : Apache/2.4.18 (Ubuntu) System : User : www-data ( ) PHP Version : 7.0.33-0ubuntu0.16.04.16 Disable Function : disk_free_space,disk_total_space,diskfreespace,dl,exec,fpaththru,getmyuid,getmypid,highlight_file,ignore_user_abord,leak,listen,link,opcache_get_configuration,opcache_get_status,passthru,pcntl_alarm,pcntl_fork,pcntl_waitpid,pcntl_wait,pcntl_wifexited,pcntl_wifstopped,pcntl_wifsignaled,pcntl_wexitstatus,pcntl_wtermsig,pcntl_wstopsig,pcntl_signal,pcntl_signal_dispatch,pcntl_get_last_error,pcntl_strerror,pcntl_sigprocmask,pcntl_sigwaitinfo,pcntl_sigtimedwait,pcntl_exec,pcntl_getpriority,pcntl_setpriority,php_uname,phpinfo,posix_ctermid,posix_getcwd,posix_getegid,posix_geteuid,posix_getgid,posix_getgrgid,posix_getgrnam,posix_getgroups,posix_getlogin,posix_getpgid,posix_getpgrp,posix_getpid,posix,_getppid,posix_getpwnam,posix_getpwuid,posix_getrlimit,posix_getsid,posix_getuid,posix_isatty,posix_kill,posix_mkfifo,posix_setegid,posix_seteuid,posix_setgid,posix_setpgid,posix_setsid,posix_setuid,posix_times,posix_ttyname,posix_uname,pclose,popen,proc_open,proc_close,proc_get_status,proc_nice,proc_terminate,shell_exec,source,show_source,system,virtual MySQL : OFF | cURL : ON | WGET : ON | Perl : ON | Python : ON | Sudo : ON | Pkexec : ON Directory : /usr/share/doc/libhtml-parser-perl/examples/ |
Upload File : |
For most of these scripts if you run them with a file argument, where the file contains some HTML, you should get some output. The 'h*sub' scripts take two arguments the first of which is a perl expression and the second an HTML file. In any case all of the files have an exlanatory comment. For example try running: lynx -dump -source -raw http://www.debian.org > /tmp/a.txt ./hanchors /tmp/a.txt Of course if http://www.debian.org is not your favourite web site you can make the appropriate substitution. hanchors - List all anchors in the HTML hlc - Correct any upper case tags to lower case hstrip - Removes deprecated scripting and styling tags and attributes htextsub - Apply arbirary perl expression to all text within HTML hrefsub - Apply arbirary perl expression to all hrefs within HTML htitle - Print title of the HTML document hdump - Output event information whilst parsing HTML document hform - Print analysis of form controls present in HTML htext - Print all the text from the HTML