All four teams from the UT Programming Team finished top six, including taking first place, in the South Central USA Regional ...
This groundbreaking contribution is a testament to Miikkulainen’s lasting impact on the field of neural networks, a legacy ...
We consider the task of evaluating a policy for a Markov decision process (MDP).The standard unbiased technique for evaluating a policy is to deploy the policyand observe its performance. We show that ...
Our students and faculty are changing the world through their contributions to computing education, research, and industry. These awards received by members of the UT Computer Science community make ...
Artificial Intelligence and Life in 2030. Peter Stone, Rodney Brooks, Erik Brynjolfsson, Ryan Calo, Oren Etzioni, Greg Hager, Julia Hirschberg, Shivaram ...
Patrick MacAlpine and Peter Stone.
Though computers have surpassed humans at many tasks, especially computationally intensive ones, there are many tasks for which human expertise remains necessary and/or useful. For such tasks, it is ...
Transfer Learning for Reinforcement Learning Domains: A Survey. Matthew E. Taylor and Peter Stone. Journal of Machine Learning Research, 10(1):1633–1685, 2009.
Multiagent Traffic Management: A Reservation-Based Intersection Control Mechanism. Kurt Dresner and Peter Stone. In The Third International Joint Conference on Autonomous Agents and Multiagent Systems ...
Imitation from observation (IfO) is the problem of learning directly from state-only demonstrations without having access to the demonstrator's actions.The lack of action information both ...
TEXPLORE: Real-Time Sample-Efficient Reinforcement Learning for Robots. Todd Hester and Peter Stone. Machine Learning, 90(3):385–429, 2013.
RoboCup-2012: Robot Soccer World Cup XVI, Xiaoping Chen, Peter Stone, Luis Enrique Sucar, and Tijn van der Zant, editors. Lecture Notes in Artificial Intelligence, Springer Verlag, Berlin, 2013.