TY - GEN
T1 - Model-based pose estimation by consensus
AU - Jorstad, Anne
AU - Burlina, Philippe
AU - Wang, I. Jeng
AU - Lucarelli, Dennis
AU - DeMenthon, Daniel
PY - 2008/12/1
Y1 - 2008/12/1
N2 - We present a system for determining a consensus estimate of the pose of an object, as seen from multiple cameras in a distributed network. The cameras are pointed towards a 3D object defined by a configuration of points, which are assumed to be visible and detected in all camera images. The cameras are given a model defining the 3D configuration of these object points, but do not know which image point corresponds to which object point. Each camera estimates the pose of the object, then iteratively exchanges information with its neighbors to arrive at a common decision of the pose over the network. We consider eight variations of the consensus algorithm, and find that each converges to a more accurate result than do the individual cameras alone on average. The method exchanging 3D world coordinates penalized to agree with the input model provides the most accurate results. If bandwidth is limited, performing consensus over rotations and translations requires cameras to exchange only the six values specifying the six degrees of freedom of the object pose, and performing consensus in SE(3) using the Karcher mean is generally the best choice. We show further that interleaving pose calculation with the consensus iterations improves the final result when the image noise is large.
AB - We present a system for determining a consensus estimate of the pose of an object, as seen from multiple cameras in a distributed network. The cameras are pointed towards a 3D object defined by a configuration of points, which are assumed to be visible and detected in all camera images. The cameras are given a model defining the 3D configuration of these object points, but do not know which image point corresponds to which object point. Each camera estimates the pose of the object, then iteratively exchanges information with its neighbors to arrive at a common decision of the pose over the network. We consider eight variations of the consensus algorithm, and find that each converges to a more accurate result than do the individual cameras alone on average. The method exchanging 3D world coordinates penalized to agree with the input model provides the most accurate results. If bandwidth is limited, performing consensus over rotations and translations requires cameras to exchange only the six values specifying the six degrees of freedom of the object pose, and performing consensus in SE(3) using the Karcher mean is generally the best choice. We show further that interleaving pose calculation with the consensus iterations improves the final result when the image noise is large.
UR - http://www.scopus.com/inward/record.url?scp=63149109540&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=63149109540&partnerID=8YFLogxK
U2 - 10.1109/ISSNIP.2008.4762050
DO - 10.1109/ISSNIP.2008.4762050
M3 - Conference contribution
AN - SCOPUS:63149109540
SN - 9781424429578
T3 - ISSNIP 2008 - Proceedings of the 2008 International Conference on Intelligent Sensors, Sensor Networks and Information Processing
SP - 569
EP - 574
BT - ISSNIP 2008 - Proceedings of the 2008 International Conference on Intelligent Sensors, Sensor Networks and Information Processing
T2 - 2008 International Conference on Intelligent Sensors, Sensor Networks and Information Processing, ISSNIP 2008
Y2 - 15 December 2008 through 18 December 2008
ER -