如何用HttpClient从网页中抓取图片

问题描述

网页中有这样一段代码现实了一张图片<imgborder="0"Align="center"src="image.do"/>

我该如何通过httpclient类获取这张图片呢?我试过下面这个方法但是获取的图片打不开，我怀疑获取的根本就不是图片，而是用来生成图片的程序，求大神指点HttpClientclient=newHttpClient();GetMethodget=newGetMethod("http://xk.fudan.edu.cn/xk/img.do");client.executeMethod(get);FilestoreFile=newFile("d:/sss.bmp");FileOutputStreamoutput=newFileOutputStream(storeFile);//得到网络资源的字节数组,并写入文件output.write(get.getResponseBody());output.close();get.releaseConnection();

解决方案

解决方案二：
可以啊，它是获取那张图片的字节流然后写到已知的一个空图片文件中去。
解决方案三：
packagecom.catchimage;importjava.io.File;importjava.io.FileOutputStream;importjava.io.IOException;importjavax.servlet.http.HttpServletRequest;importorg.apache.commons.httpclient.*;importorg.apache.commons.httpclient.methods.GetMethod;importorg.apache.commons.httpclient.params.HttpMethodParams;publicclassCatchImage{privatestaticHttpServletRequestreq;publicstaticHttpServletRequestgetReq(){returnreq;}privatestaticStringrootAddress="http://www.google.com.hk/intl/zh-CN/images/logo_cn.png";@SuppressWarnings("deprecation")publicstaticvoidmain(String[]args){HttpClienthttpClient=newHttpClient();httpClient.setConnectionTimeout(5000);httpClient.setTimeout(5000);GetMethodgetMethod=newGetMethod(rootAddress);getMethod.getParams().setContentCharset("UTF-8");getMethod.getParams().setParameter(HttpMethodParams.RETRY_HANDLER,newDefaultHttpMethodRetryHandler());try{intstatusCode=httpClient.executeMethod(getMethod);if(statusCode==HttpStatus.SC_OK){FilestoreFile=newFile("d:/google.png");FileOutputStreamoutput=newFileOutputStream(storeFile);output.write(getMethod.getResponseBody());output.close();}}catch(HttpExceptione){e.printStackTrace();}catch(IOExceptione1){e1.printStackTrace();}finally{getMethod.releaseConnection();}}}

需要commons-codec-1.4.jar,commons-httpclient-3.1.jar,servlet-api-2.5.jar,
解决方案四：
我是抓取了google首页的头图，然后写到本地google.png中去，你跑下试试

时间： 2025-01-25 12:31:52

如何用HttpClient从网页中抓取图片

问题描述

解决方案

如何用HttpClient从网页中抓取图片的相关文章

实例-如何用python把网页中js全部转换为html?

《Clojure数据分析秘笈》——1.9节从网页中抓取文本数据

如何用正则把网页中的指定的字高亮显示代码

ASP.NET实现抓取网页中的链接

php结合正则批量抓取网页中邮箱地址

爬取 html js java-大神啊！如何用HTTP请求调用网页中的脚本啊！神er

求教如何通过python抓取网页中表格信息

javascript-请问网页中如何用js实现跳转到指定行数?

javascript-python爬虫如何抓取包含JavaScript的网页中的信息