語系:
繁體中文
English
說明(常見問題)
回圖書館首頁
手機版館藏查詢
登入
回首頁
切換:
標籤
|
MARC模式
|
ISBD
The interplay between information an...
~
Gorantla, Siva Kumar.
FindBook
Google Book
Amazon
博客來
The interplay between information and control theory within interactive decision-making problems.
紀錄類型:
書目-電子資源 : Monograph/item
正題名/作者:
The interplay between information and control theory within interactive decision-making problems./
作者:
Gorantla, Siva Kumar.
出版者:
Ann Arbor : ProQuest Dissertations & Theses, : 2012,
面頁冊數:
137 p.
附註:
Source: Dissertations Abstracts International, Volume: 75-02, Section: B.
Contained By:
Dissertations Abstracts International75-02B.
標題:
Information Technology. -
電子資源:
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3570475
ISBN:
9781303156199
The interplay between information and control theory within interactive decision-making problems.
Gorantla, Siva Kumar.
The interplay between information and control theory within interactive decision-making problems.
- Ann Arbor : ProQuest Dissertations & Theses, 2012 - 137 p.
Source: Dissertations Abstracts International, Volume: 75-02, Section: B.
Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2012.
This item must not be added to any third party search indexes.
The context for this work is two-agent team decision systems. An agent is an intelligent entity that can measure some aspect of its environment, process information and possibly influence the environment through its action. In a colloborative two-agent team decision system, the agents can be coupled by noisy or noiseless interactions and cooperate to solve problems that are beyond the individual capabilities or knowledge of each agent. This thesis focuses on using stochastic control and information theoretic tools hand-in-hand in solving and analyzing an interactive two-agent sequential decision-making problem. Stochastic control techniques can help in identifying optimal strategies for sequential decision making based on observations. Information-theoretic tools address the fundamental limit of performance between two agents with noisy interaction - in the context of communication and rate-distortion. The motivation for this work comes from the quest for using stochastic control tools in identifying optimal policies for a two-agent team decision system with an objective of maximizing the information rate. The resulting policies, if they exist, will involve decision making at each step based on observations, in contrast to existing communication schems that decide what to transmit over a long time-horizon, at the start of communication. However, there are many questions that have to be addressed: • How should we formulate a stochastic-control problem to capture information gains? • Suppose we can formulate such a control problem. Can we solve for explicit, non-random, optimal strategies that operate on sufficient statistics (thus resulting in a simple structure for optimal policies)? • Further, do these control-theory based policies assure reliability of communication in an information-theoretic sense? Consider a different problem where a third person has knowledge of the optimal policies of the two interacting agents, but is unaware of the cost function they are colloboratively optimizing. • Can he deduce what the two agents are trying to achieve based on their policies? In this thesis, we focus on addressing these questions using perspectives from both information and control theory. We consider an interacting two-agent decision-making problem consisting of a Markov source process, a causal encoder with feedback, and a causal decoder. We augment the standard formulation by considering general alphabets and a non-trivial cost function operating on current and previous symbols; this enables us to introduce the 'sequential information gain cost' function that can capture information gains accumulated at each time step. We emphasize how this problem formulation leads to a different style of coding scheme with a control-theoretic flavor. Further, we solve for structural results on these optimal policies using dynamic programming principles. We then demonstrate another interplay between information theory and control theory, at the level of reliability of message-point communication schemes, by establishing a relationship between reliability in feedback communication to the stability of the posterior belief's nonlinear filter. We also consider the two-agent inverse optimal control (IOC) problem, where a fixed policy satisfying certain statistical conditions is shown to be optimal for some cost function, using probabilistic matching. We provide examples of the applicability of this framework to communication with feedback, hidden Markov models and the nonlinear filter, decentralized control, brain-machine interfaces, and queuing theory.
ISBN: 9781303156199Subjects--Topical Terms:
1030799
Information Technology.
The interplay between information and control theory within interactive decision-making problems.
LDR
:04764nmm a2200325 4500
001
2206881
005
20190906083309.5
008
201008s2012 ||||||||||||||||| ||eng d
020
$a
9781303156199
035
$a
(MiAaPQ)AAI3570475
035
$a
AAI3570475
040
$a
MiAaPQ
$c
MiAaPQ
100
1
$a
Gorantla, Siva Kumar.
$3
3433804
245
1 4
$a
The interplay between information and control theory within interactive decision-making problems.
260
1
$a
Ann Arbor :
$b
ProQuest Dissertations & Theses,
$c
2012
300
$a
137 p.
500
$a
Source: Dissertations Abstracts International, Volume: 75-02, Section: B.
500
$a
Publisher info.: Dissertation/Thesis.
500
$a
Coleman, Todd P.
502
$a
Thesis (Ph.D.)--University of Illinois at Urbana-Champaign, 2012.
506
$a
This item must not be added to any third party search indexes.
506
$a
This item must not be sold to any third party vendors.
520
$a
The context for this work is two-agent team decision systems. An agent is an intelligent entity that can measure some aspect of its environment, process information and possibly influence the environment through its action. In a colloborative two-agent team decision system, the agents can be coupled by noisy or noiseless interactions and cooperate to solve problems that are beyond the individual capabilities or knowledge of each agent. This thesis focuses on using stochastic control and information theoretic tools hand-in-hand in solving and analyzing an interactive two-agent sequential decision-making problem. Stochastic control techniques can help in identifying optimal strategies for sequential decision making based on observations. Information-theoretic tools address the fundamental limit of performance between two agents with noisy interaction - in the context of communication and rate-distortion. The motivation for this work comes from the quest for using stochastic control tools in identifying optimal policies for a two-agent team decision system with an objective of maximizing the information rate. The resulting policies, if they exist, will involve decision making at each step based on observations, in contrast to existing communication schems that decide what to transmit over a long time-horizon, at the start of communication. However, there are many questions that have to be addressed: • How should we formulate a stochastic-control problem to capture information gains? • Suppose we can formulate such a control problem. Can we solve for explicit, non-random, optimal strategies that operate on sufficient statistics (thus resulting in a simple structure for optimal policies)? • Further, do these control-theory based policies assure reliability of communication in an information-theoretic sense? Consider a different problem where a third person has knowledge of the optimal policies of the two interacting agents, but is unaware of the cost function they are colloboratively optimizing. • Can he deduce what the two agents are trying to achieve based on their policies? In this thesis, we focus on addressing these questions using perspectives from both information and control theory. We consider an interacting two-agent decision-making problem consisting of a Markov source process, a causal encoder with feedback, and a causal decoder. We augment the standard formulation by considering general alphabets and a non-trivial cost function operating on current and previous symbols; this enables us to introduce the 'sequential information gain cost' function that can capture information gains accumulated at each time step. We emphasize how this problem formulation leads to a different style of coding scheme with a control-theoretic flavor. Further, we solve for structural results on these optimal policies using dynamic programming principles. We then demonstrate another interplay between information theory and control theory, at the level of reliability of message-point communication schemes, by establishing a relationship between reliability in feedback communication to the stability of the posterior belief's nonlinear filter. We also consider the two-agent inverse optimal control (IOC) problem, where a fixed policy satisfying certain statistical conditions is shown to be optimal for some cost function, using probabilistic matching. We provide examples of the applicability of this framework to communication with feedback, hidden Markov models and the nonlinear filter, decentralized control, brain-machine interfaces, and queuing theory.
590
$a
School code: 0090.
650
4
$a
Information Technology.
$3
1030799
650
4
$a
Electrical engineering.
$3
649834
690
$a
0489
690
$a
0544
710
2
$a
University of Illinois at Urbana-Champaign.
$b
Electrical and Computer Engineering.
$3
3182633
773
0
$t
Dissertations Abstracts International
$g
75-02B.
790
$a
0090
791
$a
Ph.D.
792
$a
2012
793
$a
English
856
4 0
$u
http://pqdd.sinica.edu.tw/twdaoapp/servlet/advanced?query=3570475
筆 0 讀者評論
館藏地:
全部
電子資源
出版年:
卷號:
館藏
1 筆 • 頁數 1 •
1
條碼號
典藏地名稱
館藏流通類別
資料類型
索書號
使用類型
借閱狀態
預約狀態
備註欄
附件
W9383430
電子資源
11.線上閱覽_V
電子書
EB
一般使用(Normal)
在架
0
1 筆 • 頁數 1 •
1
多媒體
評論
新增評論
分享你的心得
Export
取書館
處理中
...
變更密碼
登入