首页 \ 问答 \ 使用Python请求的不同响应(Different response using Python requests)

使用Python请求的不同响应(Different response using Python requests)

 我试图使用requests从URL下载图像。 使用浏览器或REST客户端，例如restlet chrome扩展名我可以检索可以保存到磁盘的正常内容，json和二进制映像。  
 使用requests作为响应结果我得到几乎相同的响应头，只有Content-Length具有不同的值 - 15字节而不是35千字节 - 并且我找不到二进制图像。  
 试图模拟浏览器发出的请求，我配置了相同的请求头，如下所示：  
headers = {"Host": "cpom.prefeitura.sp.gov.br",
           "Pragma": "no-cache",
           "Cache-Control": "no-cache",
           "DNT": "1",
           "Accept": "*/*",
           "Accept-Encoding": "gzip, deflate, br",
           "Accept-Language": "en-US,en;q=0.9,pt;q=0.8",
           "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) "
                         "AppleWebKit/537.36 (KHTML, like Gecko) "
                         "Chrome/65.0.3325.181 Safari/537.36"
           }

r = requests.get(url, stream=True, headers=headers)
 
 没有重定向，我也调试并查看requests.model.Response的内容但没有成功。  
 我错过了什么？ 我认为这是一个关于请求的细节，但我无法得到它。  
 这个我的测试：  
url = "https://cpom.prefeitura.sp.gov.br/prestador/SituacaoCadastral/ImagemCaptcha?u=8762520"
r = requests.get(url, stream=True)

if r.status_code == 200:
    print(r.raw.headers)
    with open("/home/bruno/captcha/8762520.txt", "wb") as f:  # saving as text, since is not the png image
        for chunk in r:
            f.write(chunk)
 
 这是下载图片的网址： https ： //cpom.prefeitura.sp.gov.br/prestador/SituacaoCadastral/ImagemCaptcha？u = 4067913  
 这个网站带有验证码图片： https ： //cpom.prefeitura.sp.gov.br/prestador/SituacaoCadastral  
 用一个简单的GET只会得到一个json响应体，但检查响应后，您会看到二进制响应，即图像大小〜36kb。  
 编辑 ：包括来自restlet客户端的图像  
 请求：   
 响应：  

I'm trying to download an image from a URL using requests. Using browser or a REST client, like restlet chrome extension I can retrieve the normal content, a json, and a binary image that I can save to disk. 
Using requests as response result I got almost same response headers, only Content-Length has a different value - 15 bytes instead of 35 kilobytes - and I can't found the binary image. 
Trying to simulate the request made by the browser I configure the same request header, like this: 
headers = {"Host": "cpom.prefeitura.sp.gov.br",
           "Pragma": "no-cache",
           "Cache-Control": "no-cache",
           "DNT": "1",
           "Accept": "*/*",
           "Accept-Encoding": "gzip, deflate, br",
           "Accept-Language": "en-US,en;q=0.9,pt;q=0.8",
           "User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) "
                         "AppleWebKit/537.36 (KHTML, like Gecko) "
                         "Chrome/65.0.3325.181 Safari/537.36"
           }

r = requests.get(url, stream=True, headers=headers)
 
There's no redirects, I also debug and look the content of requests.model.Response but no success. 
What I'm missing? I think that is a detail about the request, but I can't get it. 
This my test: 
url = "https://cpom.prefeitura.sp.gov.br/prestador/SituacaoCadastral/ImagemCaptcha?u=8762520"
r = requests.get(url, stream=True)

if r.status_code == 200:
    print(r.raw.headers)
    with open("/home/bruno/captcha/8762520.txt", "wb") as f:  # saving as text, since is not the png image
        for chunk in r:
            f.write(chunk)
 
This is the URL to download the image: https://cpom.prefeitura.sp.gov.br/prestador/SituacaoCadastral/ImagemCaptcha?u=4067913 
And this the site with the captcha image: https://cpom.prefeitura.sp.gov.br/prestador/SituacaoCadastral 
With a simple GET will get only a json response body, but inspecting the response you'll see the binary response, which is the image - ~36kb size. 
EDIT: include images from restlet client 
Request:  
Response: 

原文：https://stackoverflow.com/questions/49940970

更新时间：2023-04-22 08:04

最满意答案

 我的猜测是正确的，这个代码工作 - 电子表格插入«一对一»并可以应用不同的风格：  
                <script>
                $(document).ready(function() {
                    $('#result').load('-google spreadsheet html import link- #tblMain');
                });         


                </script>
 
 比我想象的要容易得多:) 

My guess was right, this code working — spreadsheet inserting «one in one» and can apply different style: 
                <script>
                $(document).ready(function() {
                    $('#result').load('-google spreadsheet html import link- #tblMain');
                });         


                </script>
 
All much easier than I expected:)

使用Python请求的不同响应(Different response using Python requests)

最满意答案

相关问答

如何从google电子表格中的多个单元格中将文本从不同的
标记中拉到每个网站？(How do I pull text from multiple cells in a google spreadsheet to website each in a different
tag?)[2022-04-22]

样式不适用于ASP Classic网站的元素(Styles are not applying to elements of ASP Classic site)[2022-05-18]

用于电子表格的Google脚本，用于从工作表上链接的PDF页面中提取数据(Google Script for spreadsheet to extract data from a PDF page linked on the sheet)[2022-05-12]

如何使用Google Script更新Google网站中嵌入的图表(How to update chart embedded in Google Site using Google Script)[2023-12-18]

将Google电子表格导入网站页面并将网站样式应用于表格标记(import google spreadsheet to site page and apply site styles to a table tag)[2023-05-27]

使用Python请求的不同响应(Different response using Python requests)

最满意答案

相关问答

如何从google电子表格中的多个单元格中将文本从不同的标记中拉到每个网站？(How do I pull text from multiple cells in a google spreadsheet to website each in a different tag?)[2022-04-22]

样式不适用于ASP Classic网站的元素(Styles are not applying to elements of ASP Classic site)[2022-05-18]

用于电子表格的Google脚本，用于从工作表上链接的PDF页面中提取数据(Google Script for spreadsheet to extract data from a PDF page linked on the sheet)[2022-05-12]

如何使用Google Script更新Google网站中嵌入的图表(How to update chart embedded in Google Site using Google Script)[2023-12-18]

将Google电子表格导入网站页面并将网站样式应用于表格标记(import google spreadsheet to site page and apply site styles to a table tag)[2023-05-27]

如何从google电子表格中的多个单元格中将文本从不同的
标记中拉到每个网站？(How do I pull text from multiple cells in a google spreadsheet to website each in a different
tag?)[2022-04-22]