⛏️pypi_data_harvest.py

This tool handles scraping package data from PyPI

pypi_data_harvest.py requires an API key from libraries.io

Description

pypi_data_harvest.py gathers package info from libraries.io and scrapes data from pypi.org for additional metadata.

By default a new data set CSV file is created, however most of the time you will use the --update flag for updating an existing data set CSV file.

New Python packages are uploaded every day, therefore you will want to update before use.

pypi_data_harvest.py requires the following dependencies:

Parameter --update, -u

Parameter --apikey, -k

Parameter --verbose, -v

py pypi_data_harvest.py

py pypi_data_harvest.py --update "pypi_info_db.csv"

py pypi_data_harvest.py -u "pypi_info_db.csv" -k "C:\\apikey.txt" -v

pypi_data_harvest.py requires an API key from libraries.io

Last updated 2 years ago