Object Detection in Videos with Deep Neural Networks

A Summary of the Project
Project Members
Project’s Academic Contributions
Models and Methods Developed within the Project
- HoughNet
- PPDet
- VIREF
- aLRP Loss
- LRP Error
Contact

A Summary of the Project

Object detection is the problem of labeling and locating the objects in a given image. Modern object detection methods solve the problem in two stages, which we can call as “search” and “recognition”. In the search phase, candidates for objects are determined independently of the class, and in the recognition phase, their classes are estimated.

Our project and our contributions focused on two key challenges:

(1) Context in object detection: Using global and local context in an end-to-end deep learning system to increase the performance of the recognition phase in object detection in images and videos; and a deep neural network-based “generalized Hough transform” method was developed as an alternative to the “object proposal” methods corresponding to the search phase. It has been shown that when our method is applied to object detection problems in images and videos, it successfully can exploit contextual information and produces better results than baseline methods.

(2) Object detection in videos with the referring expressions: A new dataset was created for object search with referring expressions (e.g. “blue car on the right”) in videos; and we have developed new methods for searching objects with referring expressions in this dataset. It has been shown that the methods we have developed are very successful in detecting objects corresponding to complex referring expressions and in generating the most appropriate referring expression for two selected objects in a video.

Project Members

PI: Emre Akbaş, Dept. of Computer Engineering, METU.
Co-PI: Sinan Kalkan, Dept. of Computer Engineering, METU
Students: Hazan Anayurt, İlker Bozcan, Barış Can Çam, Bedrettin Çetinkaya, Gökçen Gökçeoğlu, Raheem Karim Hashmani, Nermin Samet, Kemal Öksüz, Sezai Artun Özyeğin, Abdussamet Tarık Temür.

Project’s Academic Contributions

Publications

N. Samet, S. Hicsonmez, E. Akbas, “HoughNet: Integrating near and long-range evidence for visual detection”, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), under review, 2021. Arxiv
K. Oksuz, B. C. Cam, S. Kalkan, E. Akbas, “One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection Tasks”, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), under review, 2021. Arxiv, Code *Equal senior contribution.
K. Oksuz, B. C. Cam, S. Kalkan, E. Akbas, “Imbalance Problems in Object Detection: A Review”, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), in press, 2021. Arxiv, Publisher’s page, Repository *Equal senior contribution.
N. Samet, S. Hicsonmez, E. Akbas, “HoughNet: Integrating near and long-range evidence for bottom-up object detection”, European Conference on Computer Vision (ECCV), 2020. Arxiv, Publisher’s page, Code.
N. Samet, S. Hicsonmez, E. Akbas, “Reducing Label Noise in Anchor-Free Object Detection”, British Machine Vision Conference (BMVC), 2020. Arxiv, Code
K. Oksuz, B. C. Cam, E. Akbas, S. Kalkan, “A Ranking-based, Balanced Loss Function Unifying Classification and Localisation in Object Detection”, Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS), spotlight paper, 2020. Arxiv, Conference page, Code. *Equal senior contribution.
H. Anayurt, S. A. Ozyegin, U. Cetin, U. Aktas, S. Kalkan, “Searching for Ambiguous Objects in Videos using Relational Referring Expressions”, 30th British Machine Vision Conference (BMVC), 2019. Arxiv, Code.
K. Oksuz, B. C. Cam, E. Akbas, S. Kalkan, “Generating Positive Bounding Boxes for Balanced Training of Object Detectors”, IEEE Winter Conference on Applications of Computer Vision (WACV), 2020. Arxiv, Publisher’s page, Code. *Equal senior contribution.
K. Oksuz, B. C. Cam, E. Akbas, S. Kalkan, “Localization Recall Precision (LRP): A New Performance Metric for Object Detection”, European Conference on Computer Vision (ECCV), pp. 521-537, Springer, 2018. Arxiv, Publisher’s page, Code.

Theses

Completed:

Bedrettin Cetinkaya, “Does estimated depth help object detection?”, MSc Thesis, Dept. of Computer Engineering, METU, 2020. Thesis copy
Baris Can Cam, “Training object detectors by directly optimizing LRP metric”, MSc Thesis, Dept. of Computer Engineering, METU, 2020. Thesis copy
Kemal Öksüz, “Identifying and Addressing Imbalance Problems in Visual Detection”, PhD Thesis, Dept. of Computer Engineering, METU, 2021.

Ongoing:

Hazan Anayurt, MSc Thesis, Dept. of Computer Engineering, METU, 2021.
Sezai Artun Özyeğin, MSc Thesis, Dept. of Computer Engineering, METU, 2021.
Nermin Samet, PhD Thesis, Dept. of Computer Engineering, METU, 2021.

Workshop

ECCV2020 Workshop on Imbalance Problems in Computer Vision (IPCV).

Invited Talks

TBC.

K. Oksuz, B. C. Cam, E. Akbas, S. Kalkan, “A Ranking-based, Balanced Loss Function Unifying Classification and Localisation in Object Detection”, Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS), spotlight paper, 2020. Arxiv, Conference page, Code. *Equal senior contribution.
K. Oksuz, B. C. Cam, S. Kalkan, E. Akbas, “One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection Tasks”, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), under review, 2021. Arxiv, Code *Equal senior contribution.

LRP Error

A novel evaluation metric for visual detection problems.

The corresponding papers:

K. Oksuz, B. C. Cam, S. Kalkan, E. Akbas, “One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection Tasks”, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), under review, 2021. Arxiv, Code *Equal senior contribution.
K. Oksuz, B. C. Cam, E. Akbas, S. Kalkan, “Localization Recall Precision (LRP): A New Performance Metric for Object Detection”, European Conference on Computer Vision (ECCV), pp. 521-537, Springer, 2018. Arxiv, Publisher’s page, Code.

Contact

Emre Akbas, Dept. of Computer Engineering, METU

Last updated on March 29, 2021.

A Summary of the Project

Project Members

Project’s Academic Contributions

Publications

Theses

Workshop

Invited Talks

Models and Methods Developed within the Project

HoughNet

PPDet

VIREF

aLRP Loss

LRP Error

Contact