- A Summary of the Project
- Project Members
- Project’s Academic Contributions
- Models and Methods Developed within the Project
- Contact
A Summary of the Project
Object detection is the problem of labeling and locating the objects in a given image. Modern object detection methods solve the problem in two stages, which we can call as “search” and “recognition”. In the search phase, candidates for objects are determined independently of the class, and in the recognition phase, their classes are estimated.
Our project and our contributions focused on two key challenges:
(1) Context in object detection: Using global and local context in an end-to-end deep learning system to increase the performance of the recognition phase in object detection in images and videos; and a deep neural network-based “generalized Hough transform” method was developed as an alternative to the “object proposal” methods corresponding to the search phase. It has been shown that when our method is applied to object detection problems in images and videos, it successfully can exploit contextual information and produces better results than baseline methods.
(2) Object detection in videos with the referring expressions: A new dataset was created for object search with referring expressions (e.g. “blue car on the right”) in videos; and we have developed new methods for searching objects with referring expressions in this dataset. It has been shown that the methods we have developed are very successful in detecting objects corresponding to complex referring expressions and in generating the most appropriate referring expression for two selected objects in a video.
Project Members
- PI: Emre Akbaş, Dept. of Computer Engineering, METU.
- Co-PI: Sinan Kalkan, Dept. of Computer Engineering, METU
- Students: Hazan Anayurt, İlker Bozcan, Barış Can Çam, Bedrettin Çetinkaya, Gökçen Gökçeoğlu, Raheem Karim Hashmani, Nermin Samet, Kemal Öksüz, Sezai Artun Özyeğin, Abdussamet Tarık Temür.
Project’s Academic Contributions
Publications
- N. Samet, S. Hicsonmez, E. Akbas, “HoughNet: Integrating near and long-range evidence for visual detection”, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), under review, 2021. Arxiv
- K. Oksuz, B. C. Cam, S. Kalkan, E. Akbas, “One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection Tasks”, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), under review, 2021. Arxiv, Code *Equal senior contribution.
- K. Oksuz, B. C. Cam, S. Kalkan, E. Akbas, “Imbalance Problems in Object Detection: A Review”, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), in press, 2021. Arxiv, Publisher’s page, Repository *Equal senior contribution.
- N. Samet, S. Hicsonmez, E. Akbas, “HoughNet: Integrating near and long-range evidence for bottom-up object detection”, European Conference on Computer Vision (ECCV), 2020. Arxiv, Publisher’s page, Code.
- N. Samet, S. Hicsonmez, E. Akbas, “Reducing Label Noise in Anchor-Free Object Detection”, British Machine Vision Conference (BMVC), 2020. Arxiv, Code
- K. Oksuz, B. C. Cam, E. Akbas, S. Kalkan, “A Ranking-based, Balanced Loss Function Unifying Classification and Localisation in Object Detection”, Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS), spotlight paper, 2020. Arxiv, Conference page, Code. *Equal senior contribution.
- H. Anayurt, S. A. Ozyegin, U. Cetin, U. Aktas, S. Kalkan, “Searching for Ambiguous Objects in Videos using Relational Referring Expressions”, 30th British Machine Vision Conference (BMVC), 2019. Arxiv, Code.
- K. Oksuz, B. C. Cam, E. Akbas, S. Kalkan, “Generating Positive Bounding Boxes for Balanced Training of Object Detectors”, IEEE Winter Conference on Applications of Computer Vision (WACV), 2020. Arxiv, Publisher’s page, Code. *Equal senior contribution.
- K. Oksuz, B. C. Cam, E. Akbas, S. Kalkan, “Localization Recall Precision (LRP): A New Performance Metric for Object Detection”, European Conference on Computer Vision (ECCV), pp. 521-537, Springer, 2018. Arxiv, Publisher’s page, Code.
Theses
Completed:
- Bedrettin Cetinkaya, “Does estimated depth help object detection?”, MSc Thesis, Dept. of Computer Engineering, METU, 2020. Thesis copy
- Baris Can Cam, “Training object detectors by directly optimizing LRP metric”, MSc Thesis, Dept. of Computer Engineering, METU, 2020. Thesis copy
- Kemal Öksüz, “Identifying and Addressing Imbalance Problems in Visual Detection”, PhD Thesis, Dept. of Computer Engineering, METU, 2021.
Ongoing:
- Hazan Anayurt, MSc Thesis, Dept. of Computer Engineering, METU, 2021.
- Sezai Artun Özyeğin, MSc Thesis, Dept. of Computer Engineering, METU, 2021.
- Nermin Samet, PhD Thesis, Dept. of Computer Engineering, METU, 2021.
Workshop
Invited Talks
- TBC.
Models and Methods Developed within the Project
HoughNet
A method for object detection in images using context and generalized hough transform.
The corresponding paper: Samet, N., Hicsonmez, S., & Akbas, E. (2020). HoughNet: Integrating near and long-range evidence for bottom-up object detection. In European Conference on Computer Vision (pp. 406-423). link
PPDet
A method for reducing label noise in anchor-free object detectors.
The corresponding paper: Samet, N., Hicsonmez, S., & Akbas, E. (2020). Reducing Label Noise in Anchor-Free Object Detection. British Machine Vision Conference (BMVC). link
VIREF
A method and a dataset for object search in videos using referring expressions.
The corresponding paper: Anayurt, H., Ozyegin, S. A., Cetin, U., Aktas, U., & Kalkan, S. (2019). Searching for ambiguous objects in videos using relational referring expressions. British Machine Vision Conference (BMVC). link
aLRP Loss
A novel ranking based loss function for object detection.
The corresponding papers:
-
K. Oksuz, B. C. Cam, E. Akbas, S. Kalkan, “A Ranking-based, Balanced Loss Function Unifying Classification and Localisation in Object Detection”, Thirty-fourth Conference on Neural Information Processing Systems (NeurIPS), spotlight paper, 2020. Arxiv, Conference page, Code. *Equal senior contribution.
-
K. Oksuz, B. C. Cam, S. Kalkan, E. Akbas, “One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection Tasks”, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), under review, 2021. Arxiv, Code *Equal senior contribution.
LRP Error
A novel evaluation metric for visual detection problems.
The corresponding papers:
-
K. Oksuz, B. C. Cam, S. Kalkan, E. Akbas, “One Metric to Measure them All: Localisation Recall Precision (LRP) for Evaluating Visual Detection Tasks”, IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), under review, 2021. Arxiv, Code *Equal senior contribution.
-
K. Oksuz, B. C. Cam, E. Akbas, S. Kalkan, “Localization Recall Precision (LRP): A New Performance Metric for Object Detection”, European Conference on Computer Vision (ECCV), pp. 521-537, Springer, 2018. Arxiv, Publisher’s page, Code.
Contact
Emre Akbas, Dept. of Computer Engineering, METU
Last updated on March 29, 2021.