TY - GEN
T1 - Cracking BING and beyond
AU - Zhao, Qiyang
AU - Liu, Zhibin
AU - Yin, Baolin
N1 - Publisher Copyright:
© 2014. The copyright of this document resides with its authors.
PY - 2014
Y1 - 2014
N2 - Objectness proposal is an emerging field which aims to reduce candidate object windows without missing the real ones. Under the evaluation framework of DR-#WIN, lots of methods report good performance in recent years, and the best is BING in CVPR 2014. BING provides good detection rates and surprisingly high efficiencies. But what can we benefit from it from the view of computer vision research? In this paper, we show that the success of BING is rather in combinatorial geometry than in computer vision research. The secret lies in the Achilles' heel of the DR-#WIN evaluation framework: the 0:5-INT-UNION criterion. We proposed a method to construct a rather small set of windows to "cover" all legal rectangles. On images no larger than 512×512, supposing all object rectangles are not smaller than 16×16, nearly 19K windows are sufficient to cover all possible rectangles. The amount is far less than that of all sliding windows. It can be reduced further by exploiting the prior distribution of the locations and sizes of object rectangles in a greedy way. We also proposed a hybrid scheme blending both greedy and stochastic results. On the VOC2007 test set, it recalls 95:68% objects with 1000 proposal windows. The detection rates on the first ten windows are 13:99% ∼ 40:29% higher than earlier methods in average.
AB - Objectness proposal is an emerging field which aims to reduce candidate object windows without missing the real ones. Under the evaluation framework of DR-#WIN, lots of methods report good performance in recent years, and the best is BING in CVPR 2014. BING provides good detection rates and surprisingly high efficiencies. But what can we benefit from it from the view of computer vision research? In this paper, we show that the success of BING is rather in combinatorial geometry than in computer vision research. The secret lies in the Achilles' heel of the DR-#WIN evaluation framework: the 0:5-INT-UNION criterion. We proposed a method to construct a rather small set of windows to "cover" all legal rectangles. On images no larger than 512×512, supposing all object rectangles are not smaller than 16×16, nearly 19K windows are sufficient to cover all possible rectangles. The amount is far less than that of all sliding windows. It can be reduced further by exploiting the prior distribution of the locations and sizes of object rectangles in a greedy way. We also proposed a hybrid scheme blending both greedy and stochastic results. On the VOC2007 test set, it recalls 95:68% objects with 1000 proposal windows. The detection rates on the first ten windows are 13:99% ∼ 40:29% higher than earlier methods in average.
UR - https://www.scopus.com/pages/publications/84919779593
M3 - 会议稿件
AN - SCOPUS:84919779593
SN - 1901725529
T3 - BMVC 2014 - Proceedings of the British Machine Vision Conference 2014
BT - BMVC 2014 25th British Machine Vision Conference 2014
A2 - Valstar, Michel
A2 - French, Andrew
A2 - Pridmore, Tony
PB - British Machine Vision Association, BMVA
T2 - 25th British Machine Vision Conference, BMVC 2014
Y2 - 1 September 2014 through 5 September 2014
ER -