add simple script
This commit is contained in:
24
README.md
Normal file
24
README.md
Normal file
@@ -0,0 +1,24 @@
|
|||||||
|
### How to use
|
||||||
|
|
||||||
|
0. Install dependencies
|
||||||
|
```
|
||||||
|
pip install pandas
|
||||||
|
```
|
||||||
|
|
||||||
|
1. Fet the full path of the parquet file you want to parse.
|
||||||
|
```
|
||||||
|
C:\somedir\file.parquet # On Windows
|
||||||
|
/somedir/file.parquet # On Mac/Linux
|
||||||
|
```
|
||||||
|
|
||||||
|
2. Edit the parse_parquet.py with the full path of your parquet file taken from 1.
|
||||||
|
|
||||||
|
3. Run the script:
|
||||||
|
```
|
||||||
|
python3 parse_parquet.py
|
||||||
|
```
|
||||||
|
|
||||||
|
### TBD..
|
||||||
|
|
||||||
|
- Make the script prompt for the path when executed rather than having it hardcoded
|
||||||
|
- Potentially add a GUI so users can run it as an app
|
||||||
10
parse_parquet.py
Normal file
10
parse_parquet.py
Normal file
@@ -0,0 +1,10 @@
|
|||||||
|
import pandas as pd
|
||||||
|
pd.set_option('display.max_rows', None)
|
||||||
|
pd.set_option('display.max_columns', None)
|
||||||
|
pd.set_option('display.width', None)
|
||||||
|
pd.set_option('display.max_colwidth', None)
|
||||||
|
|
||||||
|
parquet_file=input('Enter the absolute path for your parquet file')
|
||||||
|
|
||||||
|
df=pd.read_parquet(parquet_file)
|
||||||
|
print(df)
|
||||||
Reference in New Issue
Block a user