商標檢索網站,中國商標網 -爬蟲
商標檢索網站,中國商標網 -爬蟲
? ? ? ? 最近有時間,找了一些比較麻煩的網站來練手,然后想起來 以前說要弄商標網的,今天就又上去看了下!
? ? ? ? 以前轉載的鏈接 :商標局網請收下我的膝蓋
商標檢索網站。? ? ? ? 上去查看了下,感覺怎么參數這么明顯了!!!???? ? ? 應該是取消了很多爬蟲限制!
? ? ? ? 然后模擬請求的試了下,請求成功,成功獲取到數值!?
?使用的接口是:
商標搜索平臺?http://sbgg.saic.gov.cn:9080/tmann/annInfoView/selectInfoidBycode.html?
http://sbgg.saic.gov.cn:9080/tmann/annInfoView/imageView.html? http://sbgg.saic.gov.cn:9080/tmann/annInfoView/annSearchDG.html
組合起來 能根據不同的 條件進行查詢,并下載最終的圖片,有一點需要注意的是 返回的是圖片鏈接列表 ,我們需要的是 下標為3的那個?
中國商標網商標查詢標庫網?簡單代碼如下(僅做學習參考):
import requests, re, json, time, randomwith open("搜索結果1.json", "r", encoding="utf-8") as f:data = f.read()def run(ann_num, page_no, ann_type_code):url = "http://sbgg.saic.gov.cn:9080/tmann/annInfoView/selectInfoidBycode.html"headers = {"Accept": "application/json, text/javascript, */*; q=0.01","Accept-Encoding": "gzip, deflate","Accept-Language": "zh-CN,zh;q=0.9,en;q=0.8","Connection": "keep-alive","Content-Type": "application/x-www-form-urlencoded; charset=UTF-8","Cookie": "",# cookie"Host": "sbgg.saic.gov.cn:9080","Origin": "http://sbgg.saic.gov.cn:9080","Referer": "http://sbgg.saic.gov.cn:9080/tmann/annInfoView/annSearch.html?annNum=","User-Agent": "Mozilla/5.0 (Windows NT 6.1; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/67.0.3396.99 Safari/537.36","X-Requested-With": "XMLHttpRequest",}data = {"annNum": ann_num,"annTypecode": ann_type_code,}response = requests.post(url=url, headers=headers, data=data, timeout=15)id = response.textprint(id)URL2 = "http://sbgg.saic.gov.cn:9080/tmann/annInfoView/imageView.html"data2 = {"id": id,"pageNum": page_no,"flag": "1",}response2 = requests.post(url=URL2, headers=headers, data=data2, timeout=15)data = response2.textdata = eval(data)image = data["imaglist"][3]print(image)if __name__ == '__main__':"""代碼僅做學習參考"""data_dict = eval(data)total = data_dict["total"] # 商標總數rows = data_dict["rows"] # 商標總數print(total)for i in rows:page_no = i["page_no"] # 頁數編號tm_name = i["tm_name"] # 商標名稱ann_type_code = i["ann_type_code"] # 請求參數tmname = i["tmname"] # 商標名稱reg_name = i["reg_name"] # 公司名稱ann_type = i["ann_type"] # 公告還是省定ann_num = i["ann_num"] # 公告期數reg_num = i["reg_num"] # 商標idid = i["id"] # 請求idrn = i["rn"] # 位置app_date = i["ann_date"] # 申請日期regname = i["regname"] # # 申請人名稱???if ann_type == "商標初步審定公告":run(ann_num, page_no, ann_type_code)time.sleep(5)
?