Getting started¶

Installation¶

pip install aind-behavior-curriculum

Documentation¶

Understanding a Curriculum¶

A Curriculum is structured as a graph of training Stage. Each Stage is associated with a Task, which defines a set of configuration parameters via TaskParameters. It is important to note that Stage wraps an INSTANCE of Task, NOT the class itself. In other words, Stage is a container for a Task that has been instantiated with a respective TaskParameters.

Stages are connected by StageTransition, which are directed edges associated with a trigger condition.

Stage and StageTransition form the nodes and edges of a Curriculum graph, respectively. With this structure alone, a user can define a basic curriculum with the flexibility of defining skip connections and regressions. For nodes with multiple ongoing edges, edges are labelled by priority, set by the user.

High-Level Curriculum

An example curriculum consisting of purely stages and stage transitions. This Curriculum consists of a skip connection between Stage ‘StageA and Stage ‘Graduated’. StageTransition are triggered on a parameter ‘t2’ and the skip transition is ordered before the transition going to Stage StageB.

\(~\)

Stages are intended to represent ‘checkpoint learning objectives’, which wrap independent sets of parameters, for example, Stage1 = {P1, P2, P3} -> Stage2 = {P4, P5, P6}.

If a curriculum demands changing the same set of parameters, for example, Stage1 = {P1, P2, P3} -> Stage1’ = {P1’, P2’, P3}, it is a good idea to use PolicyGraphs.

A PolicyGraph is a parallel programming interface for changing Stage parameters.

Full Curriculum

An example Curriculum consisting of Stage and Policy graphs. Left: The high level policy graph. Right: Internal policy graphs.

Track Curriculum

A ‘Track’ Curriculum

\(~\)

A PolicyGraph consists of Policy nodes and PolicyTransition directed edges. Policies are user-defined functions that take in the current Stage TaskParameters and return the updated Stage TaskParameters. PolicyTransitions define conditional execution of downstream Policies. Like StageTransition, PolicyTransition can connect any two arbitrary Policy and are ordered by priority set by the user. The yellow polices indicate Start policies, which are entrypoint(s) into the PolicyGraph specified by the user. Altogether, Policies and PolicyTransitions may be assembled to form arbitrary execution trees and loops.

Notably, PolicyGraph is executed in parallel (execution is done by the Trainer, discussed later). A mouse may occupy multiple policies at once and will traverse down all trigger transitions returning True, similar to current in a circuitboard. While a mouse can only occupy one Stage at a time, a mouse can and will often occupy many active policies. Intuitively, the current state of Stage parameters is the net parameter change of all active policies.

Parallel execution has the benefit of supporting asynchronous parameter updates, which is a more natural way of defining parameter changes. Rather than defining how all stage parameters all change as a group, a policy can instead define updates to individual parameters, which asynchronously trigger on different metrics.

A good example of using PolicyGraphs can be demonstrated in the ‘Track’ curriculum above.

Imagine ‘Track Stage’ manages two rig parameters, P1 and P2,and these rig parameters update independently from one another according to different metrics, in this case, metrics m1 and m2 associated with m1_rule and m2_rule respectively. With parallel execution, the most natural way of implementing this situation is with two tracks as shown, where a mouse can progress asynchronously along each parameter track. If PolicyGraph was limited to serial execution, implementing this use case would be possible but more clumsy. m1_rule and m2_rule would have to be combined into a compound policy transition and the left/right policies would need to be combined into a compound policy with additional conditional logic inside checking if m1_rule or m2_rule was triggered. With parallel execution, Policies and PolicyTransitions simplify into atomic operations.

Writing to PolicyGraph is easy. Similar to Curriculum’s API for adding, removing, and reordering stages, Stage comes with a simple API for adding, removing, and reordering policies. The structure of the high-level graph and the policy graphs can always be seen using export_diagram().

This library has been rigorously tested, and all combinations of StageGraph and PolicyGraph are supported. Here are some more examples of the possibilities. The high-level stage graph are shown to the left and the individual policy graphs are shown to the right. All diagrams have been generated automatically from examples/example_project and examples/example_project_2.

Tree Curriculum

A ‘Tree’ Curriculum

Policy Triangle Curriculum

A ‘Policy Triangle’ Curriculum

Stage Triangle Curriculum

A ‘Stage Triangle’ Curriculum

\(~\)

Understanding the Trainer and TrainerState¶

The Trainer holds the evaluation logic to suggest an update for the curriculum.’ Conceptually, it achieves this by evaluating the current state of the trainer (:py:class:` ~aind_behavior_curriculum.trainer.TrainerState`) and a compatible set of Metrics. Calling the evaluate method will return a new TrainerState with the updated mouse position and associated TaskParameters.

The evalution algorithm follows the following logic:

Trainer takes the outgoing StageTransition with the highest priority. If multiple StageTransition evaluate to True, then the StageTransition with the highest priority is chosen. Priority is set by the user using set_stage_transition_priority().

Trainer takes the outgoing PolicyTransition with the highest priority. If multiple PolicyTransition evaluate to True, then the PolicyTransition with the highest priority is chosen. Priority is set by the user using set_policy_transition_priority().

StageTransition overrides PolicyTransition. If a StageTransition and PolicyTransition both evaluate to True, the Trainer jumps directly to the next Stage.

If no transitions are True, the mouse stays in place.

For multiple active Policy that evaluate to True, Trainer sets the current Task parameters to the net combination of incident Policy.

Understanding the TrainerServer¶

Warning

This feature is still under development and it is quite thin at the moment.

The TrainerServer wraps a Trainer and aims to provide an interface to manage the training of multiple subjects. It currently supports the following features:

1) Registration: This is the entry point where the mice enter the system. Here, the user provides the TrainerServer with a mouse and associates the mouse with a curriculum, a start stage, and start policies as a starting place for evaluation.

Evaluation: For each registered mouse, the TrainerServer looks at the mouse’s current position in its hypergraph curriculum. The wrapped Trainer collects all the current outgoing transitions and checks which evaluate to True. The Trainer determines the updated hypergraph position and associated Task parameters according to the following simple rules:
Mouse Override: This allows the user to update a mouse’s position manually to any position in its curriculum. Future evaluation occurs from this new position. Due to this feature, it is possible to design a Curriculum of ‘floating stages’ and ‘floating policies’.
Mouse Eject: This allows the user to remove a mouse from its curriculum entirely. The position of the mouse is recorded as ‘None’ and stays at ‘None’ on future evaluation unless the mouse is overrides back onto curriculum.

\(~\)

Building a Curriculum¶

For examples of how to build a Curriculum, please reference examples/example_project and examples/example_project_2 within the project files and their associated diagrams, examples/example_project/diagrams and examples/example_project_2/diagrams.

Tips for building your own Curriculum:

Focus on one graph at a time. Define all the Tasks/Stages/Stage Transitions associated with the higher level graph, and then move onto defining the Policies/Policy Transitions/Start Policies associated with each Stage.
aind_behavior_curriculum.curriculum.Metrics contains all the variables that trigger conditions associated with Trainer and PolicyTransition. Progressively add to aind_behavior_curriculum.curriculum.Metrics as needed.
Keep StageTransition and PolicyTransition simple. A typical transition will only trigger on one metric variable. This makes transitions much easier to name.
Validate StageTransition and PolicyTransition priority with the export_diagram() utility, which labels edges with its rank. Use set_stage_transition_priority() and set_policy_transition_priority() to reorder priority.

\(~\)

A word on `Metrics`¶

Metrics used in the curriculum should follow the following general principles:

Metrics should be simple and cheap to calculate. A Metrics should represent a collection of variables that can be used to trigger a StageTransition or PolicyTransition. For example, a metric could be ‘time spent in stage’, ‘distance traveled’, or ‘number of licks’.
Metrics should be calculated as soon as the data is acquired, ideally at the rig.
While the calculation of these metrics will be largely up to the user, we strongly encourage users to maintain a single method that is used to solely return the populated model. This will make it easier to maintain and update the metrics as needed, without incurring in extra dependencies (e.g. plotting libraries, etc.).

Building a Trainer¶

The 4 primary functions of the Trainer described above are decoupled from any database. To use the Trainer in practice, the user must define load_data() and write_data() which connect to a user’s databases for mice curriculum, mice history, and mice metrics. Please see examples/example_project/trainer.py for an example.

\(~\)

Declarative vs Imperative syntax¶

The current version of the library is designed to be able to be used in a declarative or imperative manner.

Under the declarative paradigm, the user defines the Task`(s) and `Curriculum in a function-oriented syntax using:

Under a more imperative paradigm, the user can define the Task and Curriculum objects directly by inherting from these base classes.

For examples, see the ./examples directory.

Inside Allen Institute of Neural Dynamics¶

Allen Institute of Neural Dynamics offers an internal repository template that automatically uploads the repository’s curriculum to a central bucket available here: https://github.com/AllenNeuralDynamics/aind-behavior-curriculum-template This way, curriculums can be accessed across rig computers and reused/modified similar to Github commits.

As of (5/9/2024), a Metrics database has yet to be defined, therefore a Trainer cannot be defined.

Getting started¶

Installation¶

Documentation¶

Understanding a Curriculum¶

Understanding the Trainer and TrainerState¶

Understanding the TrainerServer¶

Building a Curriculum¶

A word on Metrics¶

Building a Trainer¶

Declarative vs Imperative syntax¶

Inside Allen Institute of Neural Dynamics¶

A word on `Metrics`¶