急!C#利用HttpWebRequest怎么获取远程html代码

2025-05-08 11:28:36
推荐回答(1个)
回答1:

以前写的一个方法:
//根据Url地址得到网页完整的HTML源码
public string GetWebContent(string Url, string Encod)
{
string srcString = "";
Uri Urled = new Uri(Url);
HttpWebRequest request = (HttpWebRequest)WebRequest.Create(Urled);
request.Timeout = 30000;
request.Headers.Set("Pragma", "no-cache");
HttpWebResponse response = (HttpWebResponse)request.GetResponse();
Stream streamReceive = response.GetResponseStream();
if (response.ContentEncoding.ToLower() == "gzip")
{//针对IIS压缩的页面
streamReceive = new GZipStream(streamReceive, CompressionMode.Decompress);
}
Encoding encoding = Encoding.GetEncoding(Encod);
StreamReader streamReader = new StreamReader(streamReceive, encoding);
srcString = streamReader.ReadToEnd();
return srcString;
}
----
补充:
那你先抓取js文件。然后正则分析出那个地址再抓啊。。。。反正首先你需要知道一个url,然后层层的剥。。。。