Jump to content

links collector


colore

Recommended Posts

hello

1)

is there a way to grab and store (in a txt file) all the "links" or "urls in the text" of all the webpages I visit, that contain a specific string eg urls like www.*.com/*.pdf ?

the program must scan the text and links of all the webpages I visit and if it finds an url of the above mask, it should store it (in a text file)

2)

I would like a program that will store (in a text file) the urls of the webpages I visit that match a specific mask eg www.google.com/*

thanks!

Link to comment
Share on other sites


I dont know of a program that can do that, i have an idea though.

Basically convert the Firefox file history.dat into a txt file, import the list into a spreadsheet app like Excel and do the filtering in that. Easier said than done though.

Currently Firefox 2.x uses the "mork" format to store its history, this will be converted to SQLite in Firefox 3.x which you can download alpha releases of.

To convert the 'mork' history.dat into txt see this page, there is a bookmarklet (which did not work for me) and program called 'Dork' in the comments that DID work for me.

http://philwilson.org/blog/2005/01/how-to-...ry-to-text.html

You may want to try the Firefox 3 alphas because it should theoretically be very easy to do it with this, you just need an sqlite parser.

http://www.squarefree.com/burningedge/

For Internet Explorer here is an article, did not work for me though:

http://mcpmag.com/columns/article.asp?EditorialsID=1595

hope this helps a little, if you find something better let me know. :)

Link to comment
Share on other sites

at that page I liked to at philwilson.org it states there is a javascript available that converts the Mork history.dat into RDF/XML at this bugzilla page.

https://bugzilla.mozilla.org/show_bug.cgi?id=241438

In fact at that page there are 3 attachments, one for javascript, one for python, and one to convert to tab delimited.

Link to comment
Share on other sites

Create an account or sign in to comment

You need to be a member in order to leave a comment

Create an account

Sign up for a new account in our community. It's easy!

Register a new account

Sign in

Already have an account? Sign in here.

Sign In Now
  • Recently Browsing   0 members

    • No registered users viewing this page.
×
×
  • Create New...