Video-Lecture-Slide-Mapper

see project.pdf for problem details

Libraries Imported :-

os, random, cv2, sys, threading, numpy, skimage.measure, PIL, pytesseract, difflib,Levenshtein

How To Run :-

python3 code.py /path_to_slides_dir/ /path_to_frames_dir/

Algorithms Used :-

Normalised Co-relation

Scale-Invariant Feature Transform (SIFT)

Optical Character Recognition (OCR)

Logic and Workflow :-

We start with reading all the files in the directory of the frames which is passed as the second argument while executing the file. For each file read , we firstly calculate the normalised co-relation of the frame with all the slides given. The normalised co-relation gives an idea for the corresponding localisation of parts of frames to the corresponding slides. We might land up with similar normalised co-relation values for different slides therefore, we need to optimize it further. To achieve that we select the top 6 files which have the maximum co-relation among the slides and scale invariant-feature transform (sift) is carried with the frame and all of the other 6 slides. The sift outputs the number of similar points between the frame and each corresponding slide. This helps us to find the most optimal image of the 6. Normalised Co-relation ensures that totally dissimilar images are eliminated from the comparison whereas sift tries to find the maximum similarity by judjing the number of similar points. If the number of points are relatively closer (<8 for the code) for the two max similar frames , there is a high probability of sift predicting wrong output due to similar values being output for two different slides. In such a scenario we apply the OCR algorithm with both of these slides and the selected frame and the final result is decided by the OCR output and written into the corresponding output.txt file. The possibility of error is reduced by going through multiple phases with each reducing the probability of error considerably. By using the following multiple phase comparison depending upon the conditions involved, we achieve quite a high accuracy. By carrying out the test on given sample_tests as well a dataset we had by clicking pics of our AI course slides and the uploaded slides we achieved an accuracy of about 92-93% .

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Frames		Frames
Slides		Slides
Project.pdf		Project.pdf
README.md		README.md
Report.txt		Report.txt
code.py		code.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Video-Lecture-Slide-Mapper

Libraries Imported :-

How To Run :-

Algorithms Used :-

Normalised Co-relation

Scale-Invariant Feature Transform (SIFT)

Optical Character Recognition (OCR)

About

Uh oh!

Releases

Packages

Languages

nsk06/Video-Lecture-Slide-Mapper

Folders and files

Latest commit

History

Repository files navigation

Video-Lecture-Slide-Mapper

Libraries Imported :-

How To Run :-

Algorithms Used :-

Normalised Co-relation

Scale-Invariant Feature Transform (SIFT)

Optical Character Recognition (OCR)

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages