tfmdp.model.cell package

Submodules

tfmdp.model.cell.basic_cell module

class tfmdp.model.cell.basic_cell.BasicMarkovCell(compiler: rddl2tf.compiler.Compiler, policy: tfmdp.policy.drp.DeepReactivePolicy, config: Optional[Dict] = None)

Bases: tensorflow.python.ops.rnn_cell_impl.RNNCell

BasicMarkovCell implements a 1-step MDP transition function as an RNNCell whose hidden state is the MDP current state and output is a tuple with next state, action, intermediate fluents, and reward.

Parameters:
  • compiler (rddl2tf.compiler.Compiler) – RDDL2TensorFlow compiler.
  • config (Dict) – The cell configuration parameters.
__call__(inputs: tensorflow.python.framework.ops.Tensor, state: Sequence[tensorflow.python.framework.ops.Tensor], scope: Optional[str] = None) → Tuple[Tuple[Sequence[tensorflow.python.framework.ops.Tensor], Sequence[tensorflow.python.framework.ops.Tensor], Sequence[tensorflow.python.framework.ops.Tensor], tensorflow.python.framework.ops.Tensor], Sequence[tensorflow.python.framework.ops.Tensor]]

Returns the cell’s output tuple and next state tensors.

Output tuple packs together the next state, action, interms, and reward tensors in order.

Parameters:
  • inputs (tf.Tensor) – The timestep input tensor.
  • state (Sequence[tf.Tensor]) – The current state tensors.
  • scope (Optional[str]) – The cell name scope.
Returns:

A pair with the cell’s output tuple and next state.

Return type:

(CellOutput, CellState)

action_size

Returns the MDP action size.

graph

Returns the cell’s computation graph.

interm_size

Returns the MDP intermediate state size.

output_size

Returns the simulation cell output size.

state_size

Returns the MDP state size.

class tfmdp.model.cell.basic_cell.OutputTuple(state, action, interms, reward)

Bases: tuple

__getnewargs__()

Return self as a plain tuple. Used by copy and pickle.

static __new__(_cls, state, action, interms, reward)

Create new instance of OutputTuple(state, action, interms, reward)

__repr__()

Return a nicely formatted representation string

_asdict()

Return a new OrderedDict which maps field names to their values.

classmethod _make(iterable, new=<built-in method __new__ of type object>, len=<built-in function len>)

Make a new OutputTuple object from a sequence or iterable

_replace(**kwds)

Return a new OutputTuple object replacing specified fields with new values

action

Alias for field number 1

interms

Alias for field number 2

reward

Alias for field number 3

state

Alias for field number 0

tfmdp.model.cell.basic_cell.cell_size(sizes: Sequence[Sequence[int]]) → Sequence[Union[Sequence[int], int]]
tfmdp.model.cell.basic_cell.to_tensor(fluents)

Module contents