POMDP Reading Group

Spring 2009

Meetings on Monday 4 - 6pm, at Adaptive Computing Lab, COM1.

This is the list of papers that have been proposed for reading and which have not been scheduled yet. The papers that have been scheduled follow below.

Continuous POMDPs
(refer to schedule below)

Factored and other compact state-space representations
Tractable Inference for Complex Stochastic Processes
Xavier Boyen and Daphne Koller. Uncertainty in Artificial Intelligence UAI '98, 1998.

Approximate Planning for Factored POMDPs
Z. Feng and E. Hansen. Sixth European Conference on Planning (ECP-01), Toledo, Spain, September 2001.

Dynamic Programming for POMDPs using a Factored State Representation
E. Hansen and Z. Feng. Fifth International Conference on Artificial Intelligence Planning and Scheduling (AIPS-00), Breckenridge, Colorado, April 2000.

Computing Optimal Policies for Partially Observable Decision Processes using Compact Representations
Craig Boutilier, David Poole. Proceedings of the Thirteenth National Conference on Artificial Intelligence (AAAI-96)

Value-directed Compression of POMDPs
Pascal Poupart and Craig Boutilier. Advances in Neural Information Processing Systems 15 (NIPS), pages 1547-1554, Vancouver, BC, 2002.

Finding Approximate POMDP solutions Through Belief Compression
N. Roy, G. Gordon and S. Thrun. Journal of Artificial Intelligence Research, 23: 1-40, 2005.

(alternative to the above)
Exponential Family PCA for Belief Compression in POMDPs
N. Roy and G. Gordon. Advances in Neural Information Processing (15) NIPS, Vancouver, BC. Dec. 2002

A Novel Orthogonal NMF-Based Belief Compression for POMDPs
Xin Li, William K. W. Cheung, Jiming Liu, Zhili Wu. ICML 2007.

Hierarchical
Approximate Planning with Hierarchical Partially Observable Markov Decision Processes for Robot Navigation
Georgios Theocharous and Sridhar Mahadevan. IEEE Conference on Robotics and Automation (ICRA), Washington, D.C. May 2002.

Synthesis of hierarchical finite-state controllers for POMDPs
E. Hansen, R. Zhou. Proceedings of the Thirteenth International Conference on Automated Planning and Scheduling. 2003.

HQ-Learning
Wiering and Schmidhuber. Adaptive Behavior.1997; 6: 219-246.

Model minimization in Markov decision processes
T. Dean, R. Givan. Proceedings of the Fourteenth National Conference on Artificial Intelligence, AAAI, 1997, pp. 106-111.

Applications
(refer to schedule below)

Interactive-POMDP
A Framework for Sequential Planning in Multiagent Settings
Piotr Gmytrasiewicz and Prashant Doshi, Journal of AI Research, Vol 24: 49-79, 2005.
(or conference version:
A Framework for Sequential Planning in Multi-Agent Settings
Piotr Gmytrasiewicz and Prashant Doshi, Proceedings of the Eighth International Symposium on Artificial Intelligence and Mathematics, January 4-6, 2004.)

Decentralized-POMDP
(refer to schedule below)

Meeting Schedule

 Date                            Topic Presenter Remarks
28 Feb
08
Applications

Assisting Persons with Dementia during Handwashing Using a Partially Observable Markov Decision Process
Jesse Hoey, Axel von Bertoldi, Pascal Poupart, and Alex Mihailidis. Proceedings of the Intl Conf on Vision Systems (ICVS), Biefeld, Germany, 2007.

Value-Directed Human Behavior Analysis from Video Using Partially Observable Markov Decision Processes
Jesse Hoey and James J. Little. IEEE PAMI, Vol 29, No 7, pp.1118-1132, July 2007.

Sylvie Ong Discuss papers to be scheduled for upcoming meetings
13 Mar
08
Factored Belief Representations

Solving Factored POMDPs with Linear Value Functions
Carlos Guestrin, Daphne Koller, Ronald Parr. IJCAI-01 workshop on Planning under Uncertainty and Incomplete Information, 2001.

Approximate Planning for Factored POMDPs using Belief State Simplification
David A. McAllester, Satinder Singh. UAI 99.

Lee Wee Sun    
3 April
08
Hierarchical

Policy-contingent abstraction for robust robot control
J. Pineau, G. Gordon & S. Thrun. Conference on Uncertainty in Articifical Intelligence (UAI). Acapulco, Mexico. pp. 477-484. Aug. 2003.

Automated Hierarchy Discovery for Planning in Partially Observable Environments
Laurent Charlin, Pascal Poupart and Romy Shioda. Advances in Neural Information Processing Systems 19 (NIPS), Vancouver, BC, 2006.

Amit Discuss papers to be scheduled for upcoming meetings
24 April
08
Decentralized POMDPs

Bounded Policy Iteration for Decentralized POMDPs
Daniel S. Bernstein, Eric A. Hansen, and Shlomo Zilberstein. Proceedings of the 19th International Joint Conference on Artificial Intelligence (IJCAI-05), Edinburgh, Scotland, July 2005.

Improved Memory-Bounded Dynamic Programming for Decentralized POMDPs
Sven Seuken and Shlomo Zilberstein. Proceedings of the Twenty-Third Conference on Uncertainty in Artificial Intelligences (UAI-07), Vancouver, BC, Canada, July, 2007.

Hanna Kur Discuss papers to be scheduled for upcoming meetings
08 May
08
Continuous POMDPs

Point-based Value Iteration for Continuous POMDPs
Josep M. Porta, Nikos Vlassis, Matthijs T.J. Spaan, Pascal Poupart. Journal of Machine Learning
Research 7(Nov):2329--2367, 2006.

Parametric POMDPs for Planning in Continuous State Spaces
A. Brooks, A. Makarenko, S.B. Williams, & H.F. Durrant-Whyte. Robotics and Autonomous Systems, vol. 54, no. 11, 2006, pp. 887-897.

David Hsu Discuss papers to be scheduled for upcoming meetings
29 May
08
Policy Iteration

Solving POMDPs by Searching in Policy Space
Eric Hansen. UAI-98.

Point-Based Policy Iteration.
Shihao Ji, Ronald Parr et al. AAAI 2007.

Hanna Kur    
19 June
08
Policy Iteration

Solving POMDPs by Searching the Space of Finite Policies
Nicolas Meuleau, Kee-Eung Kim, Leslie Pack Kaelbling, Anthony R. Cassandra. Proc. of the Conf. on Uncertainty in AI, 1999.

PEGASUS: A policy search method for large MDPs and POMDPs.
Andrew Y. Ng, Michael Jordan. Proc. of the Conf. on Uncertainty in AI, 2000.

Sylvie Ong    
16 Feb
09
ADDs part 1

Symbolic Boolean Manipulation with Ordered Binary Decision Diagrams
R. E. Bryant. ACM Computing Surveys, Vol. 24, No. 3 (September, 1992), pp. 293–318.

Algebraic Decision Diagrams and Their Applications
R.I. Bahar; E.A. Frohm; C.M. Gaona; G.D. Hachtel; E. Macii; and A. Pardo.
International Conference on CAD (1993), 188–191.

Graph-based algorithms for boolean function manipulation.
R. E. Bryant. IEEE Transactions on Computers 35(8):677–691, 1986.

Efficient ADD Operations for Point-Based Algorithms
Guy Shani, Ronen I. Brafman, Solomon E. Shimony and Pascal Poupart.
Proceedings of the Eighteenth International Conference on Automated Planning and Scheduling (ICAPS),
Sydney, Australia, 2008.

       

Adaptive Intelligence Reading Group

Spring 2005

The semester's reading is centered around techniques to allow a mobile robot to automatically map the environment, recognize human activities and decide the appropriate action to achieve its objectives.

 Date                            Topic Presenter Remarks
18 March Estimate where you are: Kalman filtering Amit  
1 April Estimate where you are: Particle filtering Nikhila  
15 April Estimate where you are: Expectation Maximization (EM) Wee Sun The method is essentially running EM on a HMM. Those not familiar with HMM may want to have a look at the HMM notes from 29 April.
29 April Recognizing activities: Hidden Markov Model (HMM), Conditional Random Field (CRF )
Lecture notes, Lee Wee Sun
http://www-2.cs.cmu.edu/~lafferty/ps/crf.ps
Hai Leong's ICML submission
Hai Leong  
13 May Decide what you do: MDP and POMDP Hanna  
27 May Decide what you do: on-line decision making Xiaoxi  
login · edit · upload · history · print
Page last modified on February 06, 2009, at 05:06 PM