Getting set up
Steps to install CSAPP and start the web app
Step 2: Install the dependencies
pip install streamlit duckdb csapptools beautifulsoup4 requests argparse
Step 3: Download the full PyPI data set
https://drive.google.com/file/d/176E-5gb97egmRe8W9K-bh2YqHPn1W0PE/view?usp=sharing
Due to the file size being about 160 MB, I have to host the file on Google Drive.
Store the data set CSV file in the CSAPP directory.
Step 4: Update the data set
pypi_data_harvest.py requires an API key from libraries.io
Create an account on libraries.io
Create a hidden directory in your user's home folder called
.librariesio
Saved libraries.io api key in a txt file called
api_key.txt
py pypi_data_harvest.py --update "pypi_info_db.csv"
New PyPI packages are uploaded every day, therefore you will want to update the data set before use.
Last updated