PyPAC

v0.18.3 suspicious
4.0
Medium Risk

Proxy auto-config and auto-discovery for Python.

🤖 AI Analysis

Final verdict: SUSPICIOUS

The package shows low risks in terms of network, shell, and obfuscation activities, but it contains a non-secure external link and is maintained by a new maintainer with limited PyPI experience, which raises some concerns about its integrity.

  • Non-secure external link
  • New maintainer with only one PyPI package
Per-check LLM notes
  • Network: No network calls detected, which is normal if the package does not require external communication.
  • Shell: No shell execution patterns detected, indicating no suspicious command-line activity.
  • Obfuscation: No obfuscation patterns detected, indicating low risk.
  • Credentials: No credential harvesting patterns detected, indicating low risk.
  • Metadata: The package has a non-secure external link and a new maintainer with only one package on PyPI, which could indicate potential risk.

🔬 Heuristic Checks

Outbound Network Calls

No suspicious network call patterns found

Code Obfuscation

No obfuscation patterns detected

Shell / Subprocess Execution

No shell execution patterns detected

Credential Harvesting

No credential harvesting patterns detected

Typosquatting

No typosquatting candidates detected

Registered Email Domain

Email domain looks legitimate: users.noreply.github.com

Suspicious Page Links score 2.0

Found 1 suspicious link(s) on the package page

  • Non-HTTPS external link: http://example.org
Git Repository History

Repository carsonyl/pypac appears legitimate

Maintainer History score 2.0

1 maintainer concern(s) found

  • Author "Carson Lam" appears to have only 1 package on PyPI (new or inactive account)
Known CVE Vulnerabilities

No known vulnerabilities found in OSV database.

💡 AI App Starter Prompt

Use this prompt to build a project with PyPAC
Create a simple web scraper tool that intelligently navigates through multiple pages of a website while handling dynamic content loading via JavaScript. This tool should be able to automatically detect and use any available proxy servers through PAC (Proxy Auto-Configuration) files to bypass potential IP blocking mechanisms employed by the target site. Utilize the Python package 'PyPAC' to manage the proxy configuration and discovery process.

Key Features:
1. Ability to specify a URL to scrape, including support for pagination.
2. Use Selenium for rendering JavaScript-heavy pages.
3. Implement PyPAC to dynamically select and configure proxies based on the PAC file provided.
4. Support for custom user agents to mimic different browsers.
5. Output scraped data into a structured format like CSV or JSON.
6. Include error handling for common issues such as network timeouts and HTTP errors.
7. Provide a command-line interface for easy interaction.

Steps to Build:
1. Set up a Python virtual environment and install necessary packages including PyPAC, Selenium, and Pandas.
2. Develop a function to load and parse the PAC file using PyPAC.
3. Create a class for the web scraper that integrates with Selenium and PyPAC for dynamic proxy selection.
4. Implement functions to navigate through web pages and extract desired data.
5. Add functionality to handle different types of pagination methods (e.g., infinite scroll, next page buttons).
6. Design the output mechanism to save scraped data in CSV/JSON format.
7. Write a main script to tie everything together, providing a CLI for users to input URLs and other parameters.
8. Test the application thoroughly with various websites and scenarios.
9. Document the code and include instructions for setup and usage.