at-gan

v0.13.9 safe
4.0
Medium Risk

Training Framework for Arbitrary Tabular Generative Adversarial Networks

πŸ€– AI Analysis

Final verdict: SAFE

The package shows low risk indicators with no network calls, shell executions, or credential harvesting attempts. However, the metadata risk due to sparse author information slightly elevates the score.

  • Low network and shell execution risks
  • Sparse author information
Per-check LLM notes
  • Network: No network calls detected, which is normal if the package does not require internet access.
  • Shell: No shell execution patterns detected, indicating no immediate signs of executing system commands.
  • Obfuscation: No obfuscation patterns detected, indicating low risk.
  • Credentials: No credential harvesting patterns detected, indicating low risk.
  • Metadata: The author's information is sparse, which may indicate a lack of transparency or a new/inactive maintainer.

πŸ“¦ Package Quality Overall: Medium (5.2/10)

β—ˆ Medium Test Suite 6.0

Partial test coverage signals detected

  • 1 test file(s) detected (e.g. test_package.py)
β—ˆ Medium Documentation 7.0

Some documentation present

  • Documentation URL: "Documentation" -> https://github.com/Jns-M/at-gan#readme
  • Detailed PyPI description (19215 chars)
β—‹ Low Contributing Guide 2.0

No contributing guide or governance files found

  • No CONTRIBUTING, CODE_OF_CONDUCT, or governance files found
β—ˆ Medium Type Annotations 5.0

Partial type annotation coverage

  • 97 type-annotated function signatures detected in source
β—ˆ Medium Multiple Contributors 6.0

Limited contributor diversity

  • 2 unique contributor(s) across 44 commits in Jns-M/at-gan
  • Two distinct contributors found

πŸ”¬ Heuristic Checks

βœ“ Outbound Network Calls

No suspicious network call patterns found

βœ“ Code Obfuscation

No obfuscation patterns detected

βœ“ Shell / Subprocess Execution

No shell execution patterns detected

βœ“ Credential Harvesting

No credential harvesting patterns detected

βœ“ Typosquatting

No typosquatting candidates detected

βœ“ Registered Email Domain

Email domain looks legitimate: miesenboeck.at>

βœ“ Suspicious Page Links

All external links appear legitimate

βœ“ Git Repository History

Repository Jns-M/at-gan appears legitimate

⚠ Maintainer History score 4.0

2 maintainer concern(s) found

  • Author name is missing or very short
  • Author "" appears to have only 1 package on PyPI (new or inactive account)
βœ“ Known CVE Vulnerabilities

No known vulnerabilities found in OSV database.

πŸ’‘ AI App Starter Prompt

Use this prompt to build a project with at-gan
Create a data augmentation tool using the 'at-gan' package for generating synthetic tabular datasets. This tool will be particularly useful for researchers and data scientists who need to augment their datasets without compromising privacy or introducing biases from real-world data sources. The application should have the following functionalities:

1. **Data Input**: Users should be able to upload a CSV file containing their original dataset. The tool should handle basic preprocessing such as handling missing values, encoding categorical variables, and normalizing numerical columns.

2. **Model Training**: Using the 'at-gan' framework, train a generative adversarial network (GAN) on the uploaded dataset. The model should learn the underlying distribution of the input data and generate new samples that are statistically similar to the training set.

3. **Synthetic Data Generation**: After training, allow users to specify the number of synthetic rows they wish to generate. The application should then use the trained GAN to produce these synthetic samples.

4. **Evaluation Tools**: Implement basic evaluation metrics to assess the quality and diversity of the generated data compared to the original dataset. Metrics could include statistical tests like Kolmogorov-Smirnov test for numerical distributions and chi-square tests for categorical distributions.

5. **Export Functionality**: Provide an option for users to export the generated synthetic data as a CSV file for further analysis or integration into other applications.

To utilize the 'at-gan' package, you'll need to leverage its capabilities for training GANs on arbitrary tabular data. Specifically, focus on how 'at-gan' handles complex tabular structures, including mixed data types (categorical and numerical), and ensures that the synthetic data maintains the integrity of the original dataset's structure and distribution.

πŸ’¬ Discussion Feed

Leave a comment

No discussion yet. Be the first to share your thoughts!