<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	>
<channel>
	<title>Comments for screen-scrapeable</title>
	<atom:link href="http://blog.screen-scraper.com/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.screen-scraper.com</link>
	<description>Thoughts, tips, and updates on screen-scraping</description>
	<pubDate>Sat, 04 Feb 2012 09:58:36 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.5.1</generator>
		<item>
		<title>Comment on How to surf and screen-scrape anonymously by J.K.</title>
		<link>http://blog.screen-scraper.com/2007/03/01/how-to-surf-and-screen-scrape-anonymously/#comment-192349</link>
		<dc:creator>J.K.</dc:creator>
		<pubDate>Tue, 06 Dec 2011 19:38:24 +0000</pubDate>
		<guid isPermaLink="false">http://blog.screen-scraper.com/2007/03/01/how-to-surf-and-screen-scrape-anonymously/#comment-192349</guid>
		<description>Anonymizers  are great, but you traffic can still be read. A VPN is better in this regard, it will encrypt everything that leaves your computer. I use a OpenVPN service that encrypts with AES and 256 bits keys.  Site: http://www.sunvpn.com/.</description>
		<content:encoded><![CDATA[<p>Anonymizers  are great, but you traffic can still be read. A VPN is better in this regard, it will encrypt everything that leaves your computer. I use a OpenVPN service that encrypts with AES and 256 bits keys.  Site: <a href="http://www.sunvpn.com/" rel="nofollow">http://www.sunvpn.com/</a>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Techniques for Scraping Large Datasets by Stu</title>
		<link>http://blog.screen-scraper.com/2008/07/07/large-data/#comment-185869</link>
		<dc:creator>Stu</dc:creator>
		<pubDate>Mon, 12 Sep 2011 17:24:26 +0000</pubDate>
		<guid isPermaLink="false">http://blog.screen-scraper.com/2008/07/07/large-data/#comment-185869</guid>
		<description>I figured out there was something wrong with the "for (i=0;" portion because there was no closing bracket in your example, BUT being a sysadmin and not a programmer makes life a little difficult sometimes.  Hopefully the posted information will assist someone else with the head scratching....  Thanks Jason.</description>
		<content:encoded><![CDATA[<p>I figured out there was something wrong with the &#8220;for (i=0;&#8221; portion because there was no closing bracket in your example, BUT being a sysadmin and not a programmer makes life a little difficult sometimes.  Hopefully the posted information will assist someone else with the head scratching&#8230;.  Thanks Jason.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Techniques for Scraping Large Datasets by jason</title>
		<link>http://blog.screen-scraper.com/2008/07/07/large-data/#comment-185860</link>
		<dc:creator>jason</dc:creator>
		<pubDate>Mon, 12 Sep 2011 15:05:04 +0000</pubDate>
		<guid isPermaLink="false">http://blog.screen-scraper.com/2008/07/07/large-data/#comment-185860</guid>
		<description>Stu,

You seem to be missing a bit. Look at this:

&lt;code&gt;/*

Script gets all page numbers from the Pages extractor pattern, and iterates through them

*/

// Get variable
pages = Integer.parseInt(session.getVariable(”Pages”));

// Clear session variable so it doesn’t linger
session.setVariable(”Pages”, null);

// Loop through pages
for (i=0; i&gt;pages; i++)
{
	// Since the page list appears twice, use only a number larger than that just used
	if (i&gt;session.getVariable(”PAGE”))
	{
		session.setVariable(”PAGE”, i);
		session.log(”+++Scraping page #” + i);
		session.scrapeFile(”Next search results”);
	}
	else
	{
		session.log(”+++Already have page #” + i + ” so not scraping”);
	}
}
&lt;/code&gt;</description>
		<content:encoded><![CDATA[<p>Stu,</p>
<p>You seem to be missing a bit. Look at this:</p>
<p><code>/*</p>
<p>Script gets all page numbers from the Pages extractor pattern, and iterates through them</p>
<p>*/</p>
<p>// Get variable<br />
pages = Integer.parseInt(session.getVariable(”Pages”));</p>
<p>// Clear session variable so it doesn’t linger<br />
session.setVariable(”Pages”, null);</p>
<p>// Loop through pages<br />
for (i=0; i>pages; i++)<br />
{<br />
	// Since the page list appears twice, use only a number larger than that just used<br />
	if (i>session.getVariable(”PAGE”))<br />
	{<br />
		session.setVariable(”PAGE”, i);<br />
		session.log(”+++Scraping page #” + i);<br />
		session.scrapeFile(”Next search results”);<br />
	}<br />
	else<br />
	{<br />
		session.log(”+++Already have page #” + i + ” so not scraping”);<br />
	}<br />
}<br />
</code></p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Techniques for Scraping Large Datasets by Stu</title>
		<link>http://blog.screen-scraper.com/2008/07/07/large-data/#comment-185855</link>
		<dc:creator>Stu</dc:creator>
		<pubDate>Mon, 12 Sep 2011 13:47:07 +0000</pubDate>
		<guid isPermaLink="false">http://blog.screen-scraper.com/2008/07/07/large-data/#comment-185855</guid>
		<description>OK, Figured out the above problem ””, Replace ” with " if you cut and paste the above, BUT I have been presented with another...  argh!!  An error occurred while processing the script: The error message was: class bsh.ParseException (line 23): if-- Encountered "if" at line 23, column 5. 

Line 23 reads: if (i&#62;session.getVariable("PAGE"))

I do hope all this pain will help someone else....  If anyone can shed some light on the "New" problem, it would be appreciated..</description>
		<content:encoded><![CDATA[<p>OK, Figured out the above problem ””, Replace ” with &#8221; if you cut and paste the above, BUT I have been presented with another&#8230;  argh!!  An error occurred while processing the script: The error message was: class bsh.ParseException (line 23): if&#8211; Encountered &#8220;if&#8221; at line 23, column 5. </p>
<p>Line 23 reads: if (i&gt;session.getVariable(&#8221;PAGE&#8221;))</p>
<p>I do hope all this pain will help someone else&#8230;.  If anyone can shed some light on the &#8220;New&#8221; problem, it would be appreciated..</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Techniques for Scraping Large Datasets by Stu</title>
		<link>http://blog.screen-scraper.com/2008/07/07/large-data/#comment-185847</link>
		<dc:creator>Stu</dc:creator>
		<pubDate>Mon, 12 Sep 2011 11:04:11 +0000</pubDate>
		<guid isPermaLink="false">http://blog.screen-scraper.com/2008/07/07/large-data/#comment-185847</guid>
		<description>I invoke the first script "When I get the to the search results page, the first extractor runs as always and drills into the details of each result as usual. The second extractor pattern grabs all the pages listed so I get a dataSet named “Pages,” containing links to pages 2 through 20, and I save the dataSet as a session variable. On the scripts tab, I then run this script after the file is scraped:"

.... da da da Script Runs with result: The error message was: class bsh.EvalError (line 9): session .getVariable ( Pages ) -- Error in method invocation: Attempt to pass void argument (position 0) to method: getVariable

Line Nine reads "pages = session.getVariable(”Pages”);"

On my Search results page I extract "~@PAGE@~", "~@Pages@~" and "~@Next@~" all are stored as a Session.  My gut is telling me that the Session Requested "session.getVariable(”Pages”);" is empty.</description>
		<content:encoded><![CDATA[<p>I invoke the first script &#8220;When I get the to the search results page, the first extractor runs as always and drills into the details of each result as usual. The second extractor pattern grabs all the pages listed so I get a dataSet named “Pages,” containing links to pages 2 through 20, and I save the dataSet as a session variable. On the scripts tab, I then run this script after the file is scraped:&#8221;</p>
<p>&#8230;. da da da Script Runs with result: The error message was: class bsh.EvalError (line 9): session .getVariable ( Pages ) &#8212; Error in method invocation: Attempt to pass void argument (position 0) to method: getVariable</p>
<p>Line Nine reads &#8220;pages = session.getVariable(”Pages”);&#8221;</p>
<p>On my Search results page I extract &#8220;~@PAGE@~&#8221;, &#8220;~@Pages@~&#8221; and &#8220;~@Next@~&#8221; all are stored as a Session.  My gut is telling me that the Session Requested &#8220;session.getVariable(”Pages”);&#8221; is empty.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Screen-Scraping for iPhone, Andriod, Blackberry, and Most Any Other Mobile Device by Todd Wilson</title>
		<link>http://blog.screen-scraper.com/2011/03/14/screen-scraping-for-iphone-andriod-blackberry-and-most-any-other-mobile-device/#comment-182962</link>
		<dc:creator>Todd Wilson</dc:creator>
		<pubDate>Mon, 15 Aug 2011 14:47:44 +0000</pubDate>
		<guid isPermaLink="false">http://blog.screen-scraper.com/?p=139#comment-182962</guid>
		<description>You can actually think of screen-scraper much like you would a web or database server in this respect.  Just as a web server can handle many requests at once, screen-scraper can run multiple scrapes simultaneously.  But also like a web server, screen-scraper can get overwhelmed if it's asked to run too many scrapes at once.  As this occurs it would probably make sense to either add more screen-scraper instances on the current machine, or add more machines running screen-scraper.</description>
		<content:encoded><![CDATA[<p>You can actually think of screen-scraper much like you would a web or database server in this respect.  Just as a web server can handle many requests at once, screen-scraper can run multiple scrapes simultaneously.  But also like a web server, screen-scraper can get overwhelmed if it&#8217;s asked to run too many scrapes at once.  As this occurs it would probably make sense to either add more screen-scraper instances on the current machine, or add more machines running screen-scraper.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Screen-Scraping for iPhone, Andriod, Blackberry, and Most Any Other Mobile Device by smol84</title>
		<link>http://blog.screen-scraper.com/2011/03/14/screen-scraping-for-iphone-andriod-blackberry-and-most-any-other-mobile-device/#comment-182833</link>
		<dc:creator>smol84</dc:creator>
		<pubDate>Sat, 13 Aug 2011 22:56:44 +0000</pubDate>
		<guid isPermaLink="false">http://blog.screen-scraper.com/?p=139#comment-182833</guid>
		<description>great article and very helpful blog but I am curious as to how you would configure the screen scraper to run multiple instance at once (or multiple screen scraper threads). Is it best to have it across multiple computers (similar to distributed computing) or is it best to have one computer running multiple screen scrape threads or sessions, so that when one session is not scraping another can operate. What's the best way to approach this?</description>
		<content:encoded><![CDATA[<p>great article and very helpful blog but I am curious as to how you would configure the screen scraper to run multiple instance at once (or multiple screen scraper threads). Is it best to have it across multiple computers (similar to distributed computing) or is it best to have one computer running multiple screen scrape threads or sessions, so that when one session is not scraping another can operate. What&#8217;s the best way to approach this?</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Scraping CAPTCHA forms (you know, those HTML forms with the wavy text) by scottw</title>
		<link>http://blog.screen-scraper.com/2006/10/18/scraping-captcha-forms-you-know-those-html-forms-with-the-wavy-text/#comment-171957</link>
		<dc:creator>scottw</dc:creator>
		<pubDate>Tue, 17 May 2011 23:49:22 +0000</pubDate>
		<guid isPermaLink="false">http://blog.screen-scraper.com/2006/10/18/scraping-captcha-forms-you-know-those-html-forms-with-the-wavy-text/#comment-171957</guid>
		<description>Just added a sample scraping session that downloads CAPTCHA image from Google's recaptcha.com, passes image to decaptcher.com service and receives response as TEXT.

Check it out &lt;a href="http://community.screen-scraper.com/node/1838" rel="nofollow"&gt;here&lt;/a&gt;.</description>
		<content:encoded><![CDATA[<p>Just added a sample scraping session that downloads CAPTCHA image from Google&#8217;s recaptcha.com, passes image to decaptcher.com service and receives response as TEXT.</p>
<p>Check it out <a href="http://community.screen-scraper.com/node/1838" rel="nofollow">here</a>.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Version 5.5 of screen-scraper Released! by Josh</title>
		<link>http://blog.screen-scraper.com/2011/04/01/version-55-of-screen-scraper-released/#comment-169135</link>
		<dc:creator>Josh</dc:creator>
		<pubDate>Tue, 19 Apr 2011 19:14:54 +0000</pubDate>
		<guid isPermaLink="false">http://blog.screen-scraper.com/?p=144#comment-169135</guid>
		<description>A size-customizable box would be great!

Another useful customizable option would be the regarding how the regular expressions are now automatically filled in based on where you put a pattern token. In general I find this feature very useful. But in some cases I need to manually remove the regex that is automatically applied.

Thanks for the wrappable text in the last request/response windows. I also like how the sub extractor Test Pattern returns only its relevant data!</description>
		<content:encoded><![CDATA[<p>A size-customizable box would be great!</p>
<p>Another useful customizable option would be the regarding how the regular expressions are now automatically filled in based on where you put a pattern token. In general I find this feature very useful. But in some cases I need to manually remove the regex that is automatically applied.</p>
<p>Thanks for the wrappable text in the last request/response windows. I also like how the sub extractor Test Pattern returns only its relevant data!</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Version 5.5 of screen-scraper Released! by Todd Wilson</title>
		<link>http://blog.screen-scraper.com/2011/04/01/version-55-of-screen-scraper-released/#comment-169133</link>
		<dc:creator>Todd Wilson</dc:creator>
		<pubDate>Tue, 19 Apr 2011 18:51:54 +0000</pubDate>
		<guid isPermaLink="false">http://blog.screen-scraper.com/?p=144#comment-169133</guid>
		<description>Thanks for the feedback, Josh.  We took an internal poll here which concluded that the sub-extractor patterns box should be shrunk.  The vast majority of our sub-extractor patterns tend to be one-liners, so the large box ended up just consuming unnecessary space.  That said, your point is well-taken; perhaps we could add a setting that allows the size of that box to be adjusted according to preference.  I'll add it to our to-do list!</description>
		<content:encoded><![CDATA[<p>Thanks for the feedback, Josh.  We took an internal poll here which concluded that the sub-extractor patterns box should be shrunk.  The vast majority of our sub-extractor patterns tend to be one-liners, so the large box ended up just consuming unnecessary space.  That said, your point is well-taken; perhaps we could add a setting that allows the size of that box to be adjusted according to preference.  I&#8217;ll add it to our to-do list!</p>
]]></content:encoded>
	</item>
</channel>
</rss>

