Join GitHub today
GitHub is home to over 40 million developers working together to host and review code, manage projects, and build software together.
Sign upPython formatting, and gitignore additions. #326
Conversation
This comment has been minimized.
This comment has been minimized.
@ianmilligan1 I have all these steps save locally, so we can use them for documentation when the time comes |
This comment has been minimized.
This comment has been minimized.
codecov-io
commented
Jul 17, 2019
•
Codecov Report
@@ Coverage Diff @@
## master #326 +/- ##
=======================================
Coverage 74.97% 74.97%
=======================================
Files 39 39
Lines 1123 1123
Branches 197 197
=======================================
Hits 842 842
Misses 215 215
Partials 66 66 Continue to review full report at Codecov.
|
Woohoo. Very great stuff. Lots of politician faces in this sample web archive of Canadian political parties: Apologies for the delay on this – a few dozen Slack messages and wrangling and this was successfully built in a conda virtual environment (this guide was useful for future reference. For documentation purposes, on MacOS, the default URL for Spark master was formatted as |
This comment has been minimized.
This comment has been minimized.
Oh, that's good to know about the mac side of things. |
ruebot commentedJul 8, 2019
What does this Pull Request do?
Follow-on to 7a61f0e
black
andisort
on Python files.How should this be tested?
I tested locally, and it was good to go. @ianmilligan1 if you want to test on your end, grab a small WARC (990/8471 is perfect!), then:
conda install pyspark
conda install tensorflow
conda install pyarrow
export PYSPARK_PYTHON=/home/ruestn/anaconda3/bin/python
export PYSPARK_DRIVER_PYTHON=/home/ruestn/anaconda3/bin/python
Build the branch locally
Pull down the models:
cd /tmp && wget http://download.tensorflow.org/models/object_detection/ssd_mobilenet_v1_fpn_shared_box_predictor_640x640_coco14_sync_2018_07_03.tar.gz
tar -xzvf ssd_mobilenet_v1_fpn_shared_box_predictor_640x640_coco14_sync_2018_07_03.tar.gz
mkdir -p /PATH/TO/aut/src/main/python/tf/model/graph/ssd_mobilenet_v1_fpn_640x640/
cp /tmp/ssd_mobilenet_v1_fpn_shared_box_predictor_640x640_coco14_sync_2018_07_03/frozen_inference_graph.pb /PATH/TO/aut/src/main/python/tf/model/graph/ssd_mobilenet_v1_fpn_640x640/
mkdir -p /PATH/TO/aut/src/main/python/tf/model/category/
cd /PATH/TO/aut/src/main/python/tf/model/category/
wget https://raw.githubusercontent.com/tensorflow/models/master/research/object_detection/data/mscoco_label_map.pbtxt
cp /PATH/TO/aut/src/main/python/tf/util/spark.conf.example /PATH/TO/aut/src/main/python/tf/util/spark.conf
/PATH/TO/SPARK/sbin/start-master.sh
/PATH/TO/SPARK/sbin/start-slave.sh 127.0.1.1:7077
python /PATH/TO/aut/src/main/python/tf/detect.py --web_archive "/home/nruest/tmp/auk/990/8471/warcs/*" --aut_jar /home/nruest/Projects/au/aut/target/aut-0.17.1-SNAPSHOT-fatjar.jar --spark /home/nruest/bin/spark-2.4.1-bin-hadoop2.7/bin --master spark://127.0.1.1:7077 --img_model ssd --filter_size 50 50 --output_path /home/nruest/Projects/au/sample-data/aut-image-tf-testing-03
python /PATH/TO/src/main/python/tf/extract_images.py --res_dir /home/nruest/Projects/au/sample-data/aut-image-tf-testing-03 --output_dir /home/nruest/Projects/au/sample-data/aut-image-tf-testing-image-output-03 --threshold 0.85