I am looking for the application of open-loop feedback control to determine sub-optimal policies for partially observed Markov decision process. The applications normally focus on finite-horizon POMDP. I am looking for some text on the application of open-loop feedback control for infinite horizon POMDP along with some known structural results on the sub-optimal policies. Thanks
asked
anon123 |