how to do web scrapping

Can anyone advice which language and which software to use to write a basic web scrapper? Also how involved is this work? Have no prior experience with this.

My goal is to copy and paste specific information from some websites every 1/5 minutes to an excel sheet. Thanks.
 
Quote from gmst:

Can anyone advice which language and which software to use to write a basic web scrapper? Also how involved is this work? Have no prior experience with this.

My goal is to copy and paste specific information from some websites every 1/5 minutes to an excel sheet. Thanks.


have you tried "import data from websource" on excel?
 

Attachments

Quote from gmst:

Can anyone advice which language and which software to use to write a basic web scrapper? Also how involved is this work? Have no prior experience with this.

My goal is to copy and paste specific information from some websites every 1/5 minutes to an excel sheet. Thanks.

maybe try ubot
 
Quote from Eyez:

have you tried "import data from websource" on excel?

Ah I missed it. Thanks for pointing :)

I will give it a try and report back my experience.
 
Quote from gmst:

Ah I missed it. Thanks for pointing :)

I will give it a try and report back my experience.



np, i use it to pull treasury rates from yahoo finance to price out the curve
 
Quote from HurricaneUS:

maybe try ubot

Thanks never heard of it before :)

Standard edition costs 245$. Will see if excel web query can do the job.
 
If you use C# .Net it's only 3 lines to download HTML from a URL into a string. Then you can parse it any way you want, and if you know anything about parsing, mostly it will be easy stuff to code. If it's not HTML then it's harder.

Another advantage is also you can iterate through many URLs easily, in case not all info is on a single page.

Here's code for VB:

Code:
Dim client As New System.Net.WebClient
        Dim html As String
        html = client.DownloadString("http://www.bloomberg.com/quote/SPY:US")
 
Quote from gmst:

Can anyone advice which language and which software to use to write a basic web scrapper? Also how involved is this work? Have no prior experience with this.

My goal is to copy and paste specific information from some websites every 1/5 minutes to an excel sheet. Thanks.


Ive done some work scraping specific data for people in the bond markets.
One useful (though quirky) product is djuggler
www.djuggler.com

After much tinkering around with this and other products - I finally decided its easier (for me) in the long run to simply use the C# APIs and write my own code (fewer things to learn-and relearn)
 
Quote from braincell:

If you use C# .Net it's only 3 lines to download HTML from a URL into a string. Then you can parse it any way you want, and if you know anything about parsing, mostly it will be easy stuff to code. If it's not HTML then it's harder.

Another advantage is also you can iterate through many URLs easily, in case not all info is on a single page.

Here's code for VB:

Code:
Dim client As New System.Net.WebClient
        Dim html As String
        html = client.DownloadString("http://www.bloomberg.com/quote/SPY:US")

Thanks, but I haven't used C#. I can do VB though. I also have never done any parsing before. I will see if I can get some resource on net to learn it.

But the message I am getting from various posters is that it is not something that will take too much time to learn and implement.
 
Quote from gmst:

Thanks, but I haven't used C#. I can do VB though. I also have never done any parsing before. I will see if I can get some resource on net to learn it.

But the message I am getting from various posters is that it is not something that will take too much time to learn and implement.


ahhhh.... just wait until you encounter the vast number of different ways web sites are implemented
:)
 
Back
Top