ÿþ<html> <head> <TITLE>Philip S. Thomas -- Publications</TITLE> <link rel="shortcut icon" href="favicon.ico"> </head> <body> <CENTER> <h1>Technical Publications of Philip S. Thomas</h1><p> <h3>(in Reverse Chronological Order)</h3> </CENTER> <h2>Papers:</h2> <ol> <li> Philip S. Thomas. <a href="Data/pub/NIPS2011.pdf">Policy Gradient Coagent Networks</a>. To appear, <I>The Twenty-Fifth Annual Conference on Neural Information Processing Systems</I>, December 2011. <br> <li> George D. Konidaris, Scott D. Niekum, and Philip S. Thomas. <a href="Data/pub/NIPS2011b.pdf">TD³: Re-evaluating Complex Backups in Temporal Difference Learning.</a>. To appear, <I>The Twenty-Fifth Annual Conference on Neural Information Processing Systems</I>, December 2011. <br> <li> George D. Konidaris, Sarah Osentoski, and Philip S. Thomas. <a href="Data/pub/AAAI2011.pdf">Value function approximation in reinforcement learning using the Fourier basis</a>. To appear, <I>Proceedings of the Twenty-Fifth Conference on Artificial Intelligence</I>, August 2011. <br> <li> Philip S. Thomas and Andrew G. Barto. <a href="Data/pub/Thomas_CoMDPs_ICML2011.pdf">Conjugate Markov decision processes</a>. <I>Proceedings of the Twenty-Eighth International Conference on Machine Learning</I>, June 2011.<br> <a href="Data/pub/Thomas_CoMDPs_ICML2011_source.zip">Source code</a> <br> <li> Philip S. Thomas, <I><a href="Data/pub/MSThesis.pdf">A reinforcement learning controller for functional electrical stimulation of a human arm</a></I>, Master's thesis, Department of Electrical Engineering and Computer Science, Case Western Reserve University, Cleveland, OH, August 2009.<br> Adviser: Professor Michael S. Branicky <br><a href="Data/pub/MSThesisPresentation.pdf">Slide Presentation</a> <li> Philip S. Thomas, Michael S. Branicky, Antonie van den Bogert, Kathleen M. Jagodnik. <a href="Data/pub/iaai09.pdf"> Application of the actor-critic architecture to functional electrical stimulation control of a human arm.</a> <I>Proceedings of the Twenty-First Innovative Applications of Artificial Intelligence</I>, pages 165-172, Pasadena, CA, 14-16 July 2009. <br><a href="Data/pub/iaai09ppt.pdf">Presentation</a>, some images removed. <br> <li> Philip S. Thomas, Michael Branicky, Antonie van den Bogert, Kathleen Jagodnik. <a href="Data/pub/wals08.pdf"> Creating a reinforcement learning controller for functional electrical stimulation of a human arm.</a> <I>Proceedings of the Fourteenth Yale Workshop on Adaptive and Learning Systems</I>, pages 15-20, New Haven, CT, 2-4 June 2008. (Postprint with two minor corrections). <br><a href="Data/pub/wals08pres.pdf">Slide Presentation, videos removed</a> <li> Wyatt Newman, et. al. <a href="Data/pub/TeamCaseTechPaper07.pdf"> Team Case and the 2007 DARPA Urban Challenge</a> Technical Paper submitted to DARPA, 1 June 2007. <li> Alex Kandabarow, Mark Rafalko, Philip Thomas. <a href="Data/pub/Penguin.pdf">Penguins with hats, penguins with pants.</a> <I>7th Grade English with Mrs. Haiges</I>, Sewickley Academy, PA, c. 1998. </ol> <h2>Posters:</h2> <ol> <li> Kathleen Jagodnik, Philip Thomas, Michael Branicky, Antonie van den Bogert. Reinforcement Learning Controller for Planar Human Arm Movement using Functional Electrical Stimulation (FES). <I>Reseach ShowCase</I>. Case Western Reserve University, Cleveland, OH, 15 April 2010.<br> <a href="Data/pub/ShowCase10poster.pdf">Poster.</a> <li> Philip S. Thomas, Antonie van den Bogert, Kathleen Jagodnik, and Michael S. Branicky. Achieving long-term stability using a reinforcement learning controller for functional electrical stimulation control of a human arm. <I>Research ShowCase</I>. Case Western Reserve University, Cleveland, OH, 16 April 2009.<br> <a href="Data/pub/ShowCase09poster.pdf"> Poster.</a> <li> Kathleen Jagodnik, Antonie van den Bogert, Michael Branicky, and Philip Thomas. A Proportional Derivative Controller for Planar Arm Movement. <I>North American Congress on Biomechanics </I> (NACOB), Ann Arbor, Michigan, 5-9 August 2008.<br> <a href="Data/pub/NACOB08poster.pdf">Poster.</a> <li> Philip S. Thomas, Michael S. Branicky, Antonie van den Bogert, and Kathleen Jagodnik. FES Control of a Human Arm Using Reinforcement Learning. <I>Adaptive Movement in Animals and Machines </I> (AMAM), Cleveland, OH, 1-6 June 2008.<br> <a href="Data/pub/amam08.pdf"> Extended Abstract.</a> <li> Philip Thomas, Wyatt Newman. Developing the Vehicle-Behavior Interface Layer and Obstacle Field Mood for the 2007 DARPA Urban Challenge. <I>SOURCE Intersections</I>. Case Western Reserve University, Cleveland, OH, 18 April 2008.<br> <a href="Data/pub/SOURCE08.pdf">Poster.</a> <li> Philip Thomas, Antonie van den Bogert, Kathleen Jagodnik, and Michael Branicky. Creating a reinforcement learning controller for functional electrical stimulation of a human arm. <I>Research ShowCase</I>. Case Western Reserve University, Cleveland, OH, 17 April 2008.<br> <a href="Data/pub/ShowCase08poster.pdf"> Poster.</a> <li> Wyatt Newman, Roger D. Quinn, Michael Branicky, Frank Merat, et. al. Team Case and the 2007 DARPA Urban Challenge. <I>Research ShowCase</I>. Case Western Reserve University, Cleveland, OH, 17 April 2008.<br> <a href="Data/pub/ShowCaseUC08poster.pdf">Poster.</a> </ol> <hr> <a href="http://psthomas.com">Take me home</a><br> Created: <em>6-24-2009</em>. Last Modified: <em>7-21-2009</em>. &copy; Philip S. Thomas</a> <!-- Start of StatCounter Code --> <script type="text/javascript"> var sc_project=6027876; var sc_invisible=1; var sc_security="53ae9f4d"; </script> <script type="text/javascript" src="http://www.statcounter.com/counter/counter.js"></script><noscript><div class="statcounter"><a title="visit tracker on tumblr" href="http://www.statcounter.com/tumblr/" target="_blank"><img class="statcounter" src="http://c.statcounter.com/6027876/0/53ae9f4d/1/" alt="visit tracker on tumblr" ></a></div></noscript> <!-- End of StatCounter Code --> </body> </html>