-
Notifications
You must be signed in to change notification settings - Fork 0
/
Readme.txt
28 lines (18 loc) · 1.48 KB
/
Readme.txt
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
Name: Rajat Patel & Mohit Khatwani
Project: Combining Image Recognition with Knowledge Graph Embedding for Learning Semantic Attribute of Images
email ID: [email protected] & [email protected]
baseline_model_1 : The baseline model architecture implemented, the file is self sufficient to run, only changes required are file location there mentioned as global parameter
baseline_model_2: Proposed solution model architecture implemented, the file is self sufficient to run, only changes required are file location there mentioned as global parameter
VGG_encoder: The program gives encoded representation of the images and saves them as a numpy array
reading_file_loc: This file does the data preprocessing required for the model architectures and saves the required dataframe.
Running the object recognition model
yolo_recog.py: Main file to run the pretrained model, for object detection.
yolov3.py: Definition of YOLO model
yolov3.weights: weights of trained model on COCO dataset
convert_weights.py & convert_weights_pb.py: Convert weights file to protobuffer file
pickle_files:
label_to_imagedict.pickle : Dictionary with key: labels, values: list of image information from pycoco api
filename_imagedict.pickle : Dictionary with key: labels, values: file_names
label_to_object_final.pickle: Dictionary with key: labels, values: list of object detected in an image
encoded_image_vec.npz: encoded image vectors
train_vec_file: dataframe of word embedding and class labels