CAP5415-Computer Vision (FALL 2018)

 •   Sedat OZER  •   •  Home  • 

Instructor: Dr. Sedat OZER    

Class time: Tuesday/Thursday 3-4.15 pm
Class location: HEC 117
Office hours: Tuesday/Thursday 4.30-5.30 pm

COURSE GOALS: The course is introductory level computer vision course, suitable for graduate students. It will cover fundamental and modern computer vision topics. Tentative topics include:
PRE-REQUEST: Basic Probability/Statistics, a good working knowledge of any programming language (python, matlab, C/C++, or Java), Linear algebra, Vector calculus.

GRADING:Programming assignments: total30% of the final grade. There will be frequent, (almost weekly or bi-weekly) assignments.
  • The lowest graded assignment will be ignored at the end of the semester.
  • In programming assignments, it is expected that the submitted code is running without any error and generating the correct result(s) as described in the assignments. Codes giving error, will not be graded.
  • RECOMMENDED BOOKS (optional) PROGRAMMING
    Python will be main programming environment for the assignments. Following book (Python programming samples for computer vision tasks) is freely available.
    Python for Computer Vision

    Also check out both: tensorflow tutorial and keras websites.

    COLLABORATION POLICY
    Collaboration on assignments is encouraged at the level of sharing ideas and technical conversation only. Please write your own code. Students are expected to abide by UCF Golden Rule.

    LECTURE NOTES

    Lecture notes are updated weekly. I know that there are not many deep learning based slides are available (even if there are many videos available). Therefore, if you would like to access my slides in ppt format, I am happy to share. Just send me an email.

    PAPER PRESENTATION

    After the midterm, you will present your chosen paper in the class-room. This will be a group presentation relevant to your final project (you should select a paper relevant to your project).
    >
    Groups and Their 6-minutes Presentations
    Original Papers
    Michael A., Carlos A., Kartik J.
    Presentation
    Oriol Vinyals, Alexander Toshev, Samy Bengio, Dumitru Erhan, "Show and Tell: Lessons learned from the 2015 MSCOCO Image Captioning Challenge", IEEE TPAMI, 2016
    Kavin B., Rahul C., Deepak S.
    Presentation
    Octavio Arriag, Paul G. Ploger, Matias Valdenegro, "Real-time Convolutional Neural Networks for Emotion and Gender Classification", 2017
    Justin B., Timothy G., Kobee R.
    Presentation
    Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros, "Pix2Pix Image-to-Image Translation with Conditional Adversarial Networks", CVPR 2017
    Amruta K., Robert B.
    Presentation
    Waqas Sultani, Chen Chen, Mubarak Shah, "Real-world Anomaly Detection in Surveillance Videos", CVPR 2018
    Zhefu C., Sasank K.
    Presentation
    Rasmus Rothe, Radu Timofte, Luc Van Gool, "DEX: Deep EXpectation of apparent age from a single image", ICCV 2015
    Brandon K., Jack O., Madeline S.
    Presentation
    Zhe Cao, Tomas Simon, Shih-En Wei, Yaser Sheikh, "Realtime Multi-Person Pose Estimation", CVPR 2017
    Sarah S., Muhammad K.
    Presentation
    Jun-Yan Zhu, Taesung Park, Philip Isola, Alexei A. Efros, "Unpaired image-to-image translation using cycle-consistent adversarial networks"
    , ICCV 2017
    Deep K., Victoria P., Ezekiel R.
    Presentation
    Luke Metz, Ben Poole, David Pfau, Jascha Sohl-Dickstein, "Unrolled Generative Adversarial Networks" ICLR 2017
    Pei L,. Shile Z.
    Presentation
    Davy Neven, Bert De Brabandere, Stamatios Georgoulis, Marc Proesmans, Luc Van Gool, "Towards End-to-End Lane Detection: an Instance Segmentation Approach", 2018
    Abdul S., Eugene L., William Z.
    Presentation
    Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy, Scott Reed, Cheng-Yang Fu, Alexander C. Berg, "SSD: Single Shot MultiBox Detector" ECCV 2016
    Chadwick M., Jay P., Shubham S.
    Presentation
    Alexander Sage, Eirikur Agustsson, Radu Timofte, Luc Van Gool, "Logo Synthesis and Manipulation with Clustered Generative Adversarial Networks", CVPR 2018
    Prashanth K., Nishit M.
    Presentation
    Vahid Kazemi, Ali Elqursh, "Show, Ask, Attend and Answer: A Strong Baseline for Visual Question Answering", 2017
    Jefrey H., Anantapadmanaabha P., Brandon R.
    Presentation
    Waqas Sultani, Chen Chen, Mubarak Shah, "Real-World Anomaly Detection in Surveillance Videos" CVPR 2018
    Mahfuz H., Adnan R., Dhrubo C.
    Presentation
    Jacob Buckman, Aurko Roy , Colin Raffel, Ian Goodfellow, "Thermometer Encoding: One hot way to resist Adversarial Examples", ICLR 2018
    Abdullah A., Zubayer I., Mamshadnayeem R.
    Presentation
    Zhun Zhong, Liang Zheng, Zhedong Zheng, Shaozi Li, Yi Yang, "Camera Style Adaptation for Person Re-identification", CVPR 2018
    William S., Ian H.
    Presentation
    Gui-Song Xia, Xiang Bai, Jian Ding, Zhen Zhu, Serge Belongie, Jiebo Luo, Mihai Datcu, Marcello Pelillo, Liangpei Zhang, "DOTA: A Large-scale Dataset for Object Detection in Aerial Images", CVPR 2018
    Robert S.
    Presentation
    Wei Liu, Dragomir Anguelov, Dumitru Erhan, Christian Szegedy,Scott Reed, Cheng-Yang Fu, Alexander C. Berg, "SSD: Single Shot MultiBox Detector", ECCV 2016
    Vhaijaiyanthishree V., Fnu T., Neel D.
    Presentation
    Phillip Isola, Jun-Yan Zhu, Tinghui Zhou, Alexei A. Efros, "Image-to-Image Translation with Conditional Adversarial Networks", CVPR 2017
    Rodolfo R., Mahdi Z.
    Presentation
    Bojarski M, Del Testa D, Dworakowski D, Firner B, Flepp B, Goyal P, Jackel LD, Monfort M, Muller U, Zhang J, Zhang X, "End to End Learning for Self-Driving Cars", 2016
    Gaurav Bansal, Dhruv Choudhary "Convolutional Architectures for Self Driving Cars", 2017
    Shashanka V., Stephen B., Jiheng H.
    Presentation
    Waqas Sultani, Chen Chen, Mubarak Shah, "Real-world Anomaly Detection in Surveillance Videos", CVPR 2018
    Dongdong W.
    Presentation
    Sangdoo Yun, Jongwon Choi, Youngjoon Yoo, Kimin Yun, Jin Young Choi,
    "Action-Decision Networks for Visual Tracking with Deep Reinforcement Learning", CVPR 2017
    Ou Z., Lishengsa Y.
    Presentation
    David Held, Sebastian Thrun, Silvio Savarese, "Learning to Track at 100 FPS with Deep Regression Networks", ECCV 2016
    Joseph C., Stephen W.
    Presentation
    Chen Chen, Qifeng Chen, Jia Xu, Vladlen Koltun, "Learning to See in the Dark", 2018

    Final-Project Topics

    Please download the guideline for your final projects on webcourses and follow the instructions in that file. Some ideas for your projects are:

    Contact

    Name: Sedat OZER
    Email:
    URL: http://www.sedatozer.com
    Mailing address: Dr. Sedat OZER
    Center for Research in Computer Vision (CRCV)
    4328 Scorpius Street, HEC 221, UCF

    Orlando, Florida 32816, USA.

    Last updated December, 2018 by Sedat OZER.