中国计算机学会青年计算机科技论坛
CCF YOCSEF
于2014年12月1日(星期一)10:15-11:45
在南开大学(伯苓楼东区306会议室)
研讨会主题
Shape Knowledge in Segmentation and Tracking
10:15 签到
10:30 报告会开始
特邀讲者:Victor Adrian Prisacariu 博士,研究员,英国牛津大学
报告题目:Shape Knowledge in Segmentation and Tracking
执行主席:刘晓光 博士, 教授,南开大学,CCF YOCSEF天津主席
执行主席:程明明 博士,副教授,南开大学
执行主席:杨巨峰 博士,副教授,南开大学,CCF YOCSEF天津AC委员
Victor Adrian Prisacariu 博士,研究员
(1)报告内容简介:
In the talk I will detail methods for simultaneous 2D/3D segmentation, tracking and reconstruction in highly dynamic environments, which incorporate high level shape information.
I base my work on the assumption that the space of possible 2D object shapes can be either generated by projecting down known rigid 3D shapes or learned from 2D shape examples. I minimize the discrimination between statistical foreground and background appearance models with respect to the parameters governing the shape generative process (the 6 degree-of-freedom 3D pose of the 3D shape or the parameters of the learned space). The foreground region is delineated by the zero level set of a signed distance function, and I define an energy over this region and its immediate background surroundings based on pixel-wise posterior membership probabilities. I obtain the differentials of this energy with respect to the parameters governing shape and conduct searches for the correct shape using standard non-linear minimization techniques.
This methodology first leads to a novel rigid 3D object tracker. For a known 3D shape, the optimization here aims to find the 3D pose that leads to the 2D projection that best segments a given image. I also show how the approach could be accelerated to a point where real time processing on a mobile phone becomes possible.
Next, I explore deformable 2D/3D object tracking. I use a non-linear and probabilistic dimensionality reduction, called Gaussian Process Latent Variable Models, to learn spaces of shape. Segmentation becomes a minimization of an image-driven energy function in the learned space. I can represent both 2D and 3D shapes which I compress with Fourier-based transforms, to keep inference tractable.
Finally, I will also discuss various applications of the proposed techniques, ranging from 3D reconstruction on a mobile phone, to semantic SLAM and to objectness proposals.
(2)个人简介:
Dr. Victor Adrian Prisacariu is a researcher in