The document discusses an approach called the Integrated Influence Architecture for planning agent execution in dynamic environments. It generates "influence landscapes" from multiple data sources to allow agents to react deliberatively or deliberate reactively based on a continuous range of stimuli. Key components include stacks that produce landscapes from domain structure, environmental data, and plan data. The unified landscape guides agent decisions while allowing loose conformity to plans for flexibility.
14. Less well suited to environments where things are changing outside the agent's control, and fast-paced environments.
15.
16. Not too hard to remodel the world based on current observations and rethink, but it is time consuming.
17. That also assumes that the model is rich enough to capture unforeseen consequences.
18. For reactive agents, things are rarely going to go tragically wrong, but in dealing with "threats", agent may not achieve objectives.
19.
20. It isn't enough to label some aspects as being handled by a reactive component and others by a deliberative - the interactions of the two can still be disruptive to goal satisfaction.
21. We can't reason fast enough to make near-real-time deliberative decisions.
22.
23. The basic approach is to allow an agent to make decisions based on a continuous range of stimuli from a mixture of sources.
24. This will allow the agent to react deliberatively, or deliberate reactively.
25.
26.
27. Developers aim for 60fps execution - each frame gets 16ms of CPU time, but most of this is graphics processing etc. AI decisions might expect to get at most 1ms total for every agent.
57. Given a goal node, it propagates influence across a DTG, providing information for the shortest path through the DTG as well as highlighting the existence of alternate routes.
58. A naïve baseline for manipulating the environment, which other stacks then modify with more detailed information.
59.
60. It allows for updates to the perceived value of a state, and for these value alterations to be propagated out as influence to allow an agent to exploit opportunities or avoid dangers.
61.
62.
63.
64. In general Roadblocks do not signify an alteration to the physical characteristics of the world, rather an addition of other characteristics.
65. E.g. Actual roadblocks - the road still exists, and maybe will be available again within the lifecycle of the agent.
66.
67. This is where the majority of reasoning is done, and issues such as Causal satisfaction are primarily handled.
68. Forms the basis of the execution - with no additional data and no problems, the plan should be what the agent executes.
69.
70. Deviation from the plan should be permissible and achievable if necessary.
71. Imagining the plan as a trajectory through the space, do we want to strongly describe the trajectory as a ridge through the space, or weakly describe it as a set of waypoints.
72. Allowing loose conformity through weak description gives scope for deviation and alternate routes being found.
73.
74. Based on SAS+ notation we know that nodes can be grouped together, such as cities representing the UK and EU.
75. Clustering allows us to build these groupings automatically (although not necessarily as obviously as by inspection)
76.
77. Calculate the centroid of the cluster based on the centroid being the node with the smallest average weighted distance to each node within the cluster.
84. These are the waypoints that we use to form a landscape that loosely conforms to the original plan, and these nodes radiate influence to form the Plan Data Landscape.
85.
86. In the initial implementation, these are simply summed to find the overall value of each node in the landscape.
89. The Stacks are independent of each other, so can run asynchronously to update the landscape, and can be parallelised.
90. Much of the heavy-lifting can be done offline, meaning that the execution-time components are kept very simple.
91.
92. Domain structure analysis means that the agent can see alternative routes to the intended node, when the total cost of a route becomes too great, other routes will be used.
93. Environmental data means that the agent can be influenced by what is happening around it and can react to a dynamic environment.
94.
95. Still not entirely satisfied that the specific details laid out here (or glossed over) are completely appropriate.
102. Need to have a much more robust implementation in place, and ideally be built into an actual application.
103. Want to show the extensibility of the approach by developing new Stacks such as an Opponent Modelling Stack to highlight expectation of other agent's actions and how this can be factored in as an additional landscape.