登入

住宅代理

人工智慧

大規模收集數據

網頁抓取代理免費試用

在全球範圍內收集準確數據，無需擔心封鎖或中斷。

瞭解更多 >

適用於大規模視頻數據採集的無限帶寬代理解決方案

透過 Swiftproxy 強化您的業務成長

全球超過 8000 萬個住宅代理網絡，確保 99.89% 的運行時間和穩定連接，支持 HTTP(S) 和 SOCKS5 協議。

Swiftproxy residential proxies with 80M+ IPs, 99.89% uptime, supporting HTTP(S) & SOCKS5 protocols

How to use Python 3 to rotate proxies and IP addresses?

By - Emily Chan

2025-02-20 18:36:14

In web crawlers and automated tasks, frequent use of the same IP address may lead to the blocking of the target website. This article will explain how to implement proxy and IP rotation with Python 3 through 3 mainstream solutions, and provide detailed code implementation and pitfall avoidance guide.

Basic proxy rotation solution (Requests library)

1. Prepare proxy pool

proxies_pool = [
    {"http": "http://123.45.67.89:8080", "https": "http://123.45.67.89:8080"},
    {"http": "http://112.233.44.55:3128", "https": "http://112.233.44.55:3128"},
    # Expandable with more proxies...
]

2. Implement random rotation

import requests
import random
from time import sleep

def rotate_proxy_request(url):
    while True:
        try:
            proxy = random.choice(proxies_pool)
            response = requests.get(
                url,
                proxies=proxy,
                timeout=10,
                headers={"User-Agent": "Mozilla/5.0"}
            )
            if response.status_code == 200:
                return response.text
        except Exception as e:
            print(f"proxy {proxy} fail: {str(e)}")
            sleep(2)  # Delayed retry after failure

# Usage Examples
data = rotate_proxy_request("https://target-website.com/data")

3. Key parameter description

timeout: Set timeout to avoid long waiting time
Exception capture: Automatically switch to the next proxy
User-Agent rotation: It is recommended to rotate with header information

Advanced rotation scheme (Scrapy middleware)

1. Configure middleware

# middlewares.py
import random

class ProxyMiddleware:
    def process_request(self, request, spider):
        proxy = random.choice(proxies_pool)
        request.meta['proxy'] = proxy['http']
        # Add when authentication is required
        # request.headers['Proxy-Authorization'] = basic_auth_header('user', 'pass')

2. Modify settings.py

DOWNLOADER_MIDDLEWARES = {
    'myproject.middlewares.ProxyMiddleware': 543,
}

Browser Automation Solution (Selenium + Proxy)

1. Chrome Proxy Configuration

from selenium import webdriver
from selenium.webdriver.chrome.options import Options

def get_chrome_with_proxy(proxy):
    chrome_options = Options()
    chrome_options.add_argument(f'--proxy-server={proxy}')
    driver = webdriver.Chrome(options=chrome_options)
    return driver

# Usage Examples
driver = get_chrome_with_proxy("123.45.67.89:8080")
driver.get("https://target-site.com")

Notes and optimization suggestions

1. Proxy quality selection

High-anonymous proxy vs. transparent proxy
Recommend paid proxy services (such as Swiftproxy)
Free proxy needs to verify validity

2. Validity verification module

def validate_proxy(proxy):
    try:
        test = requests.get(
            "http://httpbin.org/ip",
            proxies=proxy,
            timeout=5
        )
        return test.json()['origin'] in proxy['http']
    except:
        return False

3. Intelligent switching strategy

Dynamically adjust priority based on response time
Automatically eliminate failures based on threshold

4. Legal compliance

Comply with robots.txt protocol
Control access frequency (recommended ≥5 seconds/time)

Comparison of expansion solutions

Solution	Applicable	Scenario	Advantages
Requests rotation	Simple crawler	Fast implementation	Self-managed sessions required
Scrapy middleware	Large-scale distributed crawler	Good integration	High learning cost
Selenium automation	JS rendering page	Can simulate real browser	High resource consumption

Conclusion

Through the above solutions, developers can choose the appropriate proxy rotation strategy according to specific needs. It is recommended to use paid proxy services in production environments and cooperate with the health check mechanism to ensure the quality of the proxy pool. Pay attention to setting the request interval reasonably and comply with network ethics.

關於作者

Emily Chan

Swiftproxy首席撰稿人

Emily Chan是Swiftproxy的首席撰稿人，擁有十多年技術、數字基礎設施和戰略傳播的經驗。她常駐香港，結合區域洞察力和清晰實用的表達，幫助企業駕馭不斷變化的代理IP解決方案和數據驅動增長。

Swiftproxy部落格提供的內容僅供參考，不提供任何形式的保證。Swiftproxy不保證所含資訊的準確性、完整性或合法合規性，也不對部落格中引用的第三方網站內容承擔任何責任。讀者在進行任何網頁抓取或自動化資料蒐集活動之前，強烈建議諮詢合格的法律顧問，並仔細閱讀目標網站的服務條款。在某些情況下，可能需要明確授權或抓取許可。

在這篇文章裏

頂級住宅代理解決方案