PREPRINTS
Plan online, learn offline: Efficient learning and exploration via model-based control
Lowrey K, Rajeswaran A, Kakade S, Todorov E and Mordatch I
Learning dexterous manipulation policies from experience and imitation
Kumar V, Gputa A, Todorov E and Levine S
Movie
Graphical Newton
Srinivasan A and Todorov E
PHD THESES
Manipulators and manipulation in high-dimensional spaces
Vikash Kumar (2016). University of Washington
New techniques in deep representation learning
Galen Andrew (2016). University of Washington
Automated discovery and learning of complex movement behaviors
Igor Mordatch (2015). University of Washington
Design and control of an anthropomorphic robotic hand: Learning advantages from
the human body and brain
Zhe (Joseph) Xu (2015). University of Washington
Automating stochastic control
Krishnamurthy Dvijotham (2014). University of Washington
Value-function approximation methods for linearly-solvable Markov decision processes
Minguyan Zhong (2013). University of Washington
Theory and implementation of bio-mimetic motor controllers
Yuval Tassa (2011). Hebrew University of Jerusalem
Exploratory studies of human sensorimotor learning with system identification and stochastic optimal control
Alex Simpkins (2009). University of California San Diego
Computational and psychophysical studies of goal-directed arm movements
Dan Liu (2008). University of California San Diego
Optimal control for biological movement ystems
Weiwei Li (2006). University of California San Diego
Studies of goal-directed movements
Emanuel Todorov (1998). Massachusetts Institute of Technology
PEER-REVIEWED PUBLICATIONS BY YEAR
- 2018 -
Reinforcement learning for non-prehensile manipulation:
Transfer from simulation to physical system
Lowrey K, Kolev S, Dao J, Rajeswaran A and Todorov E. In IEEE International Conference on Simulation, Modeling and Programming for Autonomous Robots (SIMPAR) 2018.
Best Paper Award
Movie
Learning complex dexterous manipulation with deep reinforcement
learning and demonstrations
Rajeswaran A, Kumar V, Gupta A, Schulman J, Todorov E and Levine S. In Robotics Science and Systems (RSS) 2018.
Project
Goal directed dynamics
Todorov E. In IEEE International Conference on Robotics and Automation (ICRA) 2018.
Movie
- 2017 -
Towards generalization and simplicity in continuous control
Rajeswaran A, Lowrey K, Todorov E and Kakade S. In Neural Information Processing Systems (NIPS) 2017.
Project
- 2016 -
Realtime state estimation with whole-body multi-contact dynamics: A modified UKF approach
Lowrey K, Dao J and Todorov E. In IEEE/RAS International Conference on Humanoid Robots
Movie
Optimal control with learned local models: Application to dexterous manipulation
Kumar V, Todorov E and Levine S. In IEEE International Conference on Robotics and Automation 2016.
Best Manipulation Paper Award
Movie
Design of a highly biomimetic anthropomorphic robotic hand: Towards artificial limb regeneration
Xu Z and Todorov E. IEEE International Conference on Robotics and Automation 2016.
Movie
- 2015 -
Interactive control of diverse complex characters with neural networks
Mordatch I, Lowrey K, Andrew G, Popovic Z and Todorov E (2015). In Neural Information Processing Systems.
Movie
Physically consistent state estimation and system identification for contacts
Kolev S and Todorov E (2015). In IEEE/RAS International Conference on Humanoid Robots.
Movie
MuJoCo HAPTIX: A virtual reality system for hand manipulation
Kumar V and Todorov E (2015). In IEEE/RAS International Conference on Humanoid Robots.
Movie
Whole-body model-predictive control applied to the HRP-2 humanoid robot
Koenemann J, Del Prete A, Tassa Y, Todorov E, Stasse O, Bennewitz M and Mansard N (2015). In IEEE/RAS International Conference on Intelligent Robots and Systems
Movie
Ensemble-CIO: Full-body dynamic motion planning that transfers to physical humanoids
Mordatch I, Lowrey K and Todorov E (2015). In IEEE/RAS International Conference on Intelligent Robots and Systems
Movie
Simulation tools for model-based robotics: Comparison of Bullet, Havok, MuJoCo, ODE and PhysX
Erez T, Tassa Y and Todorov E (2015). In International Conference on Robotics and Automation
Movie
Convex structured controller design in finite horizon
Dvijotham K, Todorov E and Fazel M (2015). IEEE Transactions on Control of Network Systems, vol 2, issue 1
- 2014 -
Convex risk-averse control design
Dvijotham K, Todorov E and Fazel M (2015). In IEEE Conference on Decision and Control
Universal convexification via risk-aversion
Dvijotham K, Fazel M and Todorov E (2014). In Uncertainty in Artificial Intelligence
Facebook Best Student Paper
Combining the benefits of function approximation and trajectory optimization
Mordatch I and Todorov E (2014). In Robotics: Science and Systems
Movie
Physically-consistent sensor fusion in contact-rich behaviors
Lowrey K, Kolev S, Tassa Y, Erez T and Todorov E (2014). In IEEE/RAS International Conference on Intelligent Robots and Systems
Movie
From inverse kinematics to optimal control
Geoffroy P, Mansard N, Raison M, Achiche S, Tassa Y and Todorov E (2014). In Advances in Robot Kinematics 2014
Convex and analytically-invertible dynamics with contacts and constraints: Theory and implementation in MuJoCo
Todorov E (2014). In International Conference on Robotics and Automation
Movie
Control-limited Differential Dynamic Programming
Tassa Y, Mansard N and Todorov E (2014). In International Conference on Robotics and Automation
Movie
Real-time behaviour synthesis for dynamic hand manipulation
Kumar V, Tassa Y, Erez T and Todorov E (2014). In International Conference on Robotics and Automation
Movie
Design, optimization, calibration and a case study of a 3D-printed,
low-cost fingertip sensor for robotic manipulation
Xu Z, Kolev S and Todorov E (2014). In International Conference on Robotics and Automation
Movie
- 2013 -
Animating human lower limbs using Contact-Invariant Optimization
Mordatch I, Wang J, Todorov E and Koltun V (2013). In SIGGRAPH ASIA
Movie
Fast, strong and compliant pneumatic actuation for
dexterous tendon-driven hands
Kumar V, Xu Z and Todorov E (2013). In International Conference on Robotics and Automation
Movie
STAC: Simultaneous tracking and calibration
Wu T, Tassa Y, Kumar V, Movellan J and Todorov E (2013). In IEEE/RAS International Conference on Humanoid Robots
Movie
A low-cost and modular, 20-DOF anthropomorphic robotic hand: Design, actuation and modeling
Xu Z, Kumar V and Todorov E (2013). In IEEE/RAS International Conference on Humanoid Robots
Movie
An integrated system for real-time model-predictive control of humanoid robots
Erez T, Lowrey K, Tassa Y, Kumar V, Kolev S and Todorov E (2013). In IEEE/RAS International Conference on Humanoid Robots
Movie
Convex control design via covariance minimization
Dvijotham K, Todorov E and Fazel M (2013). In 51st Annual Allerton Conference on Communication, Control, and Computing
Convexity of optimal linear controller design
Dvijotham K, Theodorou E, Todorov E and Fazel M (2013). In IEEE Conference on Decision and Control
Time-varying nonlinear policy gradients
Theodorou E, Dvijotham K, and Todorov E (2013). In IEEE Conference on Decision and Control
Modeling and identification of pneumatic actuators
Tassa Y, Wu T, Movellan J and Todorov E (2013). In IEEE International Conference on Mechatronics and Automation
Best paper finalist
From information-theoretic dualities to path-integral and Kullback Leibler control: Continuous and discrete-time formulationis
Theodorou E, Dvijotham K and Todorov E (2013). In 16th Yale Workshop on Learning and Adaptive Systems
Multi-robot active SLAM with relative entropy optimization
Kontitsis M, Theodorou E and Todorov E (2013). In American Control Conference
The delta-sensitivity and its application to stochastic optimal control of nonlinear diffusions
Theodorou E and Todorov E (2013). In American Control Conference
Free energy based policy gradients
Theodorou E, Najemnik J and Todorov E (2013). In IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning
Value function approximation and model-predictive control
Zhong M, Johnson M, Tassa Y, Erez T and Todorov E (2013). In IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning
- 2012 -
Linearly-solvable Markov games
Dvijotham K and Todorov E (2012). In American Control Conference
Information theoretic views on path-integral control
Theodorou E and Todorov E (2012). In NIPS Workshop on Information of Action and Perception
Reduced dimensionality control for the ACT hand
Malhotra M, Rombokas E, Theodorou E, Todorov E and Matsuoka Y (2012). In International Conference on Robotics and Automation
Tendon-driven control of biomechanical and robotic systems: A path-integral reinforcement learning approach
Rombokas E, Theodorou E, Malhotra M, Todorov E and Matsuoka Y (2012). In International Conference on Robotics and Automation
MuJoCo: A physics engine for model-based control
Todorov E, Erez T and Tassa Y (2012). In IEEE/RSJ International Conference on Intelligent Robots and Systems
Design of an anthropomorphic robotic finger system with biomimetic artificial joints
Xu Z, Kumar V, Matsuoka Y and Todorov E (2012). In IEEE Biomedical Robotics and Biomechatronics
Synthesis and stabilization of complex behaviors through online trajectory optimization
Tassa Y, Erez T and Todorov E (2012). In IEEE/RSJ International Conference on Intelligent Robots and Systems
Movie
Trajectory optimization for domains with contacts using inverse dynamics
Erez T and Todorov E (2012). In IEEE/RSJ International Conference on Intelligent Robots and Systems
Stochastic optimal control for nonlinear Markov jump diffusion processes
Theodorou E and Todorov E (2012). In American Control Conference
Relative entropy and free energy dualities: Connections to path integral and KL control
Theodorou E and Todorov E (2012). In IEEE Conference on Decision and Control
Contact-invariant optimization for hand manipulation
Mordatch I, Popovic, Z and Todorov E(2012). In Eurographics / ACM SIGGRAPH Symposium on Computer Animation
Project Page
Discovery of complex behaviors through contact-invariant optimization
Mordatch I, Todorov E and Popovic, Z (2012). In ACM SIGGRAPH
Project Page
Linearly-solvable optimal control
Dvijotham K and Todorov E (2012). In Reinforcement Learning and Approximate Dynamic Programming for Feedback Control, Lewis (ed), chap. 6, Wiley and IEEE Press
- 2011 -
Optimal limit-cycle control recast as Bayesian inference
Tassa Y, Erez T and Todorov E (2011). In World Congress of the International Federation of Automatic Control
Aggregation methods for linearly-solvable MDPs
Zhong M and Todorov E (2011). In World Congress of the International Federation of Automatic Control
Finding the most likely trajectories of optimally-controlled stochastic systems
Todorov E (2011). In World Congress of the International Federation of Automatic Control
Inverse optimality design for biological movement systems
Li W, Todorov E and Liu D (2011). In World Congress of the International Federation of Automatic Control
A unifying framework for linearly-solvable control
Dvijotham K and Todorov E (2011). In Uncertainty in Artificial Intelligence
Infinite-horizon model predictive control for nonlinear periodic tasks with contacts
Erez T, Tassa Y and Todorov E (2011). In Robotics: Science and Systems
Neuromuscular stochastic optimal control of a tendon-driven index finger model
Theodorou E, Todorov E and Valero-Cuevas F (2011). In American Control Conference
Design and analysis of an artificial finger joint for anthropomorphic robotic hands
Xu Z, Todorov E, Dellon B, Matsuoka Y (2011). In International Conference on Robotics and Automation
Modular bio-mimetic robots that can interact with the world the way we do
Simpkins A, Kelley M and Todorov E (2011). In International Conference on Robotics and Automation
A convex, smooth and invertible contact model for trajectory optimization
Todorov E (2011). In International Conference on Robotics and Automation
Movie
First-exit model predictive control of fast discontinuous dynamics: Application to ball bouncing
Kulchenko P and Todorov E (2011). In International Conference on Robotics and Automation
Movie
Complex object manipulation with hierarchical optimal control
Simpkins A and Todorov E (2011). In IEEE Adaptive Dynamic Programming and Reinforcement Learning
Policy gradient methods with model predictive control applied to ball bouncing
Kulchenko P and Todorov E (2011). In IEEE Adaptive Dynamic Programming and Reinforcement Learning
Moving least-squares approximations for linearly-solvable stochastic optimal control problems
Zhong M and Todorov E (2011). J Control Theory Appl, 9(3): 451-463
Moving least-squares approximations for linearly-solvable optimal control problems
Zhong M and Todorov E (2011). In IEEE Adaptive Dynamic Programming and Reinforcement Learning
High-order local dynamic programming
Tassa Y and Todorov E (2011). In IEEE Adaptive Dynamic Programming and Reinforcement Learning
Movie
- 2010 -
Policy gradients in linearly-solvable MDPs
Todorov E (2010). In Advances in Neural Information Processing Systems 24
Inverse optimal control with linearly-solvable MDPs
Dvijotham K and Todorov E (2010). In International Conference on Machine Learning
Identification and control of a pneumatic robot
Todorov E, Hu C, Simpkins A and Movellan J (2010). In IEEE Biomedical Robotics and Biomechatronics
Movie1;
Movie2;
Movie3
Stochastic complementarity for local control of discontinuous dynamics
Tassa Y and Todorov E (2010). In Robotics: Science and Systems
Movie
Position estimation and control of compact BLDC motors based on analog linear Hall effect sensors
Simpkins A and Todorov E (2010). In American Control Conference
Stochastic differential dynamic programming
Theodorou E, Tassa Y and Todorov E (2010). In American Control Conference
Implicit nonlinear complementarity: A new approach to contact dynamics
Todorov E (2010). In International Conference on Robotics and Automation
Movie
A first optimal control solution for a complex, nonlinear, tendon driven neuromuscular finger model
Theodorou E, Todorov E and Valero-Cuevas F (2010). In ASME Summer Bioengineering Conference
- 2009 -
Compositionality of optimal control laws
Todorov E (2009). In Advances in Neural Information Processing Systems 22, pp 1856-1864, Bengio et al (eds), MIT Press
Efficient computation of optimal actions
Todorov E (2009). PNAS, 106(28): 11478-11483
Commentary;
Supplementary information
Structured variability of muscle activations supports the minimal intervention principle of motor control
Valero-Cuevas F, Venkadesan M and Todorov E (2009). Journal of Neurophysiology,
102: 59-68
Journal cover
Eigenfunction approximation methods for linearly-solvable optimal control problems
Todorov E (2009). In proceedings of the 2nd IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, pp 161 - 168
Hierarchical optimal control of a 7-DOF arm model
Liu D and Todorov E (2009). In proceedings of the 2nd IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, pp 50 - 57
Practical numerical methods for stochastic optimal control of biological systems in continuous time and space
Simpkins A and Todorov E (2009). In proceedings of the 2nd IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, pp 212 - 218
Real-time motor control using recurrent neural networks
Huh D and Todorov E (2009). In proceedings of the 2nd IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning, pp 42 - 49
Iterative local dynamic programming
Todorov E and Tassa Y (2009). In proceedings of the 2nd IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning), pp 90 - 95
- 2008 -
General duality between optimal control and estimation
Todorov E (2008). In proceedings of the 47th IEEE Conference on Decision and Control, pp 4286 - 4292
Optimal trade-off between exploration and exploitation
Simpkins A, de Callafon R and Todorov E (2008). In proceedings of the American Control Conference, pp 33-38
Parallels between sensory and motor information processing
Todorov E (2008). In The Cognitive Neurosciences, 4th ed, Gazzaniga (ed), MIT Press
- 2007 -
Predicting reaching targets from human EEG
Hammon P, Makeig S, Poizner H, Todorov E and de Sa V (2007). IEEE Signal Processing Magazine, 25: 69-77
Evidence for the flexible sensorimotor strategies predicted by optimal feedback control
Liu D and Todorov E (2007). Journal of Neuroscience, 27: 9354-9368
Iterative linearization methods for approximately optimal control and estimation of non-linear stochastic systems
Li W and Todorov E (2007). International Journal of Control, 80: 1439-1453
Probabilistic inference of multi-joint movements, skeletal parameters and marker attachments from diverse sensor data
Todorov E (2007). IEEE Transactions on Biomedical Engineering, 54: 1927-1939
State estimation with finite signals-to-noise models via linear matrix inequalities
Li W, Skelton R and Todorov E (2007). Journal of Dynamic Systems, Measurement and Control, 129: 136-143
- 2006 -
Linearly-solvable Markov decision problems
Todorov E (2006). In Advances in Neural Information Processing Systems 19: 1369-1376, Scholkopf et al (eds), MIT Press
Iterative optimal control and estimation design for nonlinear stochastic systems
Li W and Todorov E (2006). In proceedings of the 45th IEEE Conference on Decision and Control, pp 3242-3247
Imitiation learning for reaching and grasping in virtual environments
Singh N and Todorov E (2006). In proceedings of the 5th International Conference on Development and Learning
Optimal control theory
Todorov E (2006). In Bayesian Brain: Probabilistic Approaches to Neural Coding, Doya K at al (eds), chap 12, pp 269-298, MIT Press
- 2005 -
From task parameters to motor synergies: A hierarchical framework for approximately-optimal control of redundant manipulators
Todorov E, Li W and Pan X (2005). Journal of Robotic Systems, 22(11):691-710
Towards an integrated systems for estimating multi-joint movement from diverse sensor data
Pan X, Todorov E and Li W (2005). In proceedings of the 27th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pp 4982-4985
Hierarchical feedback and learning for multi-joint arm movement control
Li W, Todorov E and Pan X (2005). In proceedings of the 27th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pp 4400-4403
A generalized iterative LQG method for locally-optimal feedback control of constrained nonlinear stochastic systems
Todorov E and Li W (2005). In proceedings of the American Control Conference, pp 300-306
MATLAB code
Estimation and control of systems with multiplicative noise via linear matrix inequalities
Li W, Skelton R and Todorov E (2005). In proceedings of the American Control Conference, pp 1811-1816
Stochastic optimal control and estimation methods adapted to the noise characteristics of the sensorimotor system
Todorov E (2005). Neural Computation, 17(5): 1084-1108
MATLAB code
- 2004 -
Hierarchical optimal control of redundant biomechanical systems
Li W, Todorov E and Pan X (2004). In proceedings of the 26th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pp 4618-4621
Development of clinician-friendly software for musculoskeletal modeling and control
Davoodi R, Urata C, Todorov E and Loeb G (2004). In proceedings of the 26th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pp 4622-4625
Analysis of the synergies underlying complex hand manipulation
Todorov E and Ghahramani Z (2004). In proceedings of the 26th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pp 4637-4640
Iterative linear-quadratic regulator design for nonlinear biological movement systems
Li W and Todorov E (2004). In proceedings of the 1st International Conference on Informatics in Control, Automation and Robotics, vol 1, pp 222-229
Optimality principles in sensorimotor control
Todorov E (2004). Nature Neuroscience 7(9): 907-915
- 2003 -
Optimal control methods suitable for biomechanical systems
Todorov E and Li W (2003). In proceedings of the 25th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pp
1758-1761
Unsupervised learning of sensory-motor primitives
Todorov E and Ghahramani Z (2003). In proceedings of the 25th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pp 1750-1753
A minimal intervention principle for coordinated movement
Todorov E and Jordan M (2003). In Advances in Neural Information Processing Systems 15: 27-34, Becker et al (eds), MIT Press
On the role of primary motor cortex in arm movement control
Todorov E (2003). In Progress in Motor Control III, ch 6, pp 125-166, Latash and Levin (eds), Human Kinetics
- 2002 -
Optimal feedback control as a theory of motor coordination
Todorov E and Jordan M (2002). Nature Neuroscience 5(11): 1226-1235
News and views;
Neuroscience news;
Supplementary information
A biomechanical model of the partially paralyzed human arm
Davoodi R, Brown I, Todorov E and Loeb G (2002). In proceedings of the 7th Annual Conference of the International Functional Electric Stimulation Society
Cosine tuning minimizes motor errors
Todorov E (2002). Neural Computation 14(6): 1233-1260
Use of virtual environments in motor learning and rehabilitation
Holden M and Todorov E (2002). In Handbook of Virtual Environments, ch 49, pp 999-1026, Stanney K (ed), Lawrence Erlbaum Associates
- 2000 and earlier -
One motor cortex, two different views
Todorov E, debate with Georgopoulos A, Ashe J, Moran D, Schwartz A and Scott S (2000). Nature Neuroscience 3(10): 963-965
Direct cortical control of muscle activation in voluntary arm movements: a model
Todorov E (2000). Nature Neuroscience 3(4): 391-398
News and views
Virtual environment training improves motor performance in two patients with stroke
Holden M, Todorov E, Callahan J and Bizzi E (1999). Neurology Report 23(2): 57-67
Smoothness maximization along a predefined path accurately predicts the speed profiles of complex arm movements
Todorov E and Jordan M (1998). Journal of Neurophysiology 80(2): 696-714
A local circuit approach to understanding integration of long-range inputs in primary visual cortex
Somers D, Todorov E et al (1998). Cerebral Cortex 8(3): 204-211
Augmented feedback presented in a virtual environment accelerates learning of a difficult motor task
Todorov E, Shadmehr R and Bizzi E (1997). Journal of Motor Behavior 29(2): 147-158
Modeling visual cortical contrast adaptation effects
Todorov E, Siapas A, Somers D and Nelson S (1997). In Computational Neuroscience: Trends in Reseach 5: 525-531, Bower (ed), Kluwer Academic
A local circuit integration approach to understanding visual cortical receptive fields
Somers D, Todorov E and Siapas A (1997). In Computational Neuroscience: Trends in Reseach 5: 505-510, Bower (ed), Kluwer Academic
A model of recurrent interactions in primary visual cortex
Todorov E, Siapas A and Somers D (1997). In Advances in Neural Information Processing Systems 9: 118-126, Mozer, Jordan, Petsche (eds), MIT Press
Variable gain control in local cortical circuitry supports context-dependent modulation by long-range connections
Somers D, Toth L, Todorov E et al (1996). In Lateral Interactions in Cortex - Structure and Function, ch 4, Sirosh et al (eds), Online Book
Catastrophic interference in human motor learning
Brashers-Krug T, Shadmehr R and Todorov E (1995). In Advances in Neural Information Processing Systems 7: 19-26, Tesauro, Touretzky, Leen (eds), MIT Press
Factorial learning by clustering features
Tenenbaum J and Todorov E (1995). In Advances in Neural Information Processing Systems 7: 561-568, Tesauro, Touretzky, Leen (eds), MIT Press
|