php中Snoopy类用法实例

2024-05-04 23:36:37

字体：大中小

来源：转载

供稿：网友

这篇文章主要介绍了php中Snoopy类用法,实例分析了使用Snoopy类实现页面抓取的相关技巧,需要的朋友可以参考下

本文实例讲述了php中Snoopy类用法。分享给大家供大家参考。具体分析如下：

这里演示了php中如何通过Snoopy抓取网页信息

snoopy类的下载地址：http://sourceforge.net/projects/snoopy/

/*

You need the snoopy.class.php from

http://snoopy.sourceforge.net/

*/

include("snoopy.class.php");

$snoopy = new Snoopy;

// need an proxy?:

//$snoopy->proxy_host = "my.proxy.host";

//$snoopy->proxy_port = "8080";

// set browser and referer:

$snoopy->agent = "Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)";

$snoopy->referer = "http://www.jonasjohn.de/";

// set some cookies:

$snoopy->cookies["SessionID"] = '238472834723489';

$snoopy->cookies["favoriteColor"] = "blue";

// set an raw-header:

$snoopy->rawheaders["Pragma"] = "no-cache";

// set some internal variables:

$snoopy->maxredirs = 2;

$snoopy->offsiteok = false;

$snoopy->expandlinks = false;

// set username and password (optional)

//$snoopy->user = "joe";

//$snoopy->pass = "bloe";

// fetch the text of the website www.google.com:

if($snoopy->fetchtext("http://www.google.com")){

// other methods: fetch, fetchform, fetchlinks, submittext and submitlinks

// response code:

print "response code: ".$snoopy->response_code." /n";

// print the headers:

print "Headers: ";

while(list($key,$val) = each($snoopy->headers)){

print $key.": ".$val." /n";

}

print " /n";

// print the texts of the website:

print "<pre>".htmlspecialchars($snoopy->results)."</pre>/n";

}

else {

print "Snoopy: error while fetching document: ".$snoopy->error."/n";

}