Standout Papers

Dynamic Programming and Markov Processes. 1961 2026 1982 2004 1.9k
  1. Dynamic Programming and Markov Processes. (1961)
    Marshall Freimer, Ronald A. Howard Journal of the American Statistical Association

Citation Impact

Citing Papers

Intensive versus Conventional Glucose Control in Critically Ill Patients
2009 Standout
Control strategies for a stochastic planner
1994
Acting Optimally in Partially Observable Stochastic Domains
1994
Cost-effective sensing during plan execution
1994
Price Expectations and the Phillips Curve
1969 StandoutNobel
Solving very large weakly coupled Markov decision processes
1998
Rewarding behaviors
1996
Econometric Analysis of Stabilization Policies
1969 StandoutNobel
Planning with deadlines in stochastic domains
1993
Reinforcement Learning with Factored States and Actions
2004 StandoutNobel
Monte Carlo Matrix Inversion and Reinforcement Learning
1993
A Theory and Test of Credit Rationing
1969 StandoutNobel
Neural Mechanisms of Hierarchical Planning in a Virtual Subway Network
2016 StandoutNobel
Tracking the Emergence of Conceptual Knowledge during Human Decision Making
2009 StandoutNobel
Mastering the game of Go without human knowledge
2017 StandoutNatureNobel
Human-level control through deep reinforcement learning
2015 StandoutNatureNobel
Continuous-Time Adaptive Critics
2007
Deep learning in neural networks: An overview
2014 Standout
Multiobjective dynamic programing with application to a reservoir
1979
Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control
2007
Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition
2000
Alternative Approaches to Analyzing Markets with Asymmetric Information: Reply
1983 StandoutNobel
A Neural Substrate of Prediction and Reward
1997 StandoutScience
Independence of irrelevant alternatives, and solutions to Nash's bargaining problem
1977 StandoutNobel
Modified Policy Iteration Algorithms for Discounted Markov Decision Problems
1978
On Sequential Decisions and Markov Chains
1962 Standout
A Probabilistic Production and Inventory Problem
1963
Planning under time constraints in stochastic domains
1995
The Pricing of Options and Corporate Liabilities
1973 StandoutNobel
An object-oriented representation for efficient reinforcement learning
2008
Reservoir Management and Operations Models: A State‐of‐the‐Art Review
1985 Standout
The Fair Wage-Effort Hypothesis and Unemployment
1990 StandoutNobel
Multilevel incremental dynamic programing
1976
Toward a Theory of Discounted Repeated Games with Imperfect Monitoring
1990
Dividend Policy: An Empirical Analysis
1968 StandoutNobel
IMPLEMENTATION OF AN OPTIMIZATION MODEL FOR OPERATION OF A METROPOLITAN RESERVOIR SYSTEM1
1977
Classification and Regression Trees.
1984 Standout
Determinants of corporate borrowing
1977 Standout
Improved dynamic programing procedures and their practical application to water resource systems
1974
Optimizing decision trees through heuristically guided search
1978
On the Faustian Dynamics of Policy and Political Power
2011
Incentive Effects of Terminations: Applications to the Credit and Labor Markets
1983 StandoutNobel
Optimal control of Markov processes with incomplete state information
1965
Dynamic programming applications in water resources
1982
Reforming the Global Economic Architecture: Lessons from Recent Crises
1999 StandoutNobel
A dynamic programming successive approximations technique with convergence proofs
1970
A MEAN‐VARIANCE THEORY OF OPTIMAL CAPITAL STRUCTURE AND CORPORATE DEBT CAPACITY
1978
Technical Note: Q-Learning
1992 Standout
Human-level performance in 3D multiplayer games with population-based reinforcement learning
2019 StandoutScienceNobel
Pareto Optimality and Competition
1981 StandoutNobel
Saving and Liquidity Constraints
1991 StandoutNobel
An income fluctuation problem
1976
The complexity of stochastic games
1992
On Nonterminating Stochastic Games
1966
Dynamic Stability and Reform of Political Institutions
2005
Numerical maximum log likelihood estimation for generalized lambda distributions
2006
Persistence of Power, Elites, and Institutions
2008 StandoutNobel
The Folk Theorem with Imperfect Public Information
1994 StandoutNobel
Learning Finite-State Controllers for Partially Observable Environments
2013
DEBT AND TAXES*
1977 StandoutNobel
Reputation Acquisition in Debt Markets
1989 StandoutNobel
On the Theory of Infinitely Repeated Games with Discounting
1988
Distributed Lags: A Survey
1967
Cycles of Conflict: An Economic Model
2014 StandoutNobel
Optimal cartel equilibria with imperfect monitoring
1986
A STOCHASTIC DYNAMIC PROGRAMMING MODEL FOR THE OPTIMUM OPERATION OF A MULTI‐PURPOSE RESERVOIR1
1973
The convergence of TD(?) for general ?
1992
The Optimal Control of Partially Observable Markov Processes over a Finite Horizon
1973
Adaptive optimal control for continuous-time linear systems based on policy iteration
2008 Standout
Reinforcement Learning: A Survey
1996 Standout
Online actor–critic algorithm to solve the continuous-time infinite horizon optimal control problem
2010 Standout
Game-theoretic models and the role of information in bargaining.
1979 StandoutNobel
The Contributions of the Economics of Information to Twentieth Century Economics
2000 StandoutNobel
Planning and acting in partially observable stochastic domains
1998 Standout
Corporate financing and investment decisions when firms have information that investors do not have
1984 Standout
Average reward reinforcement learning: Foundations, algorithms, and empirical results
1996
Using Expectation-Maximization for Reinforcement Learning
1997 StandoutNobel
Convergence rate analysis of the state increment dynamic programming method
1983
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning
1999 Standout
Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation
1997
Reliability‐constrained reservoir control problems: 1. Methodological issues
1979
A MODEL OF WARRANT PRICING IN A DYNAMIC MARKET
1970
Stochastic dynamic programming with factored representations
2000
Rules Rather than Discretion: The Inconsistency of Optimal Plans
1977 StandoutNobel
Contraction Mappings in the Theory Underlying Dynamic Programming
1967
Externalities in Economies with Imperfect Information and Incomplete Markets
1986 StandoutNobel
A Partitioning Algorithm with Application in Pattern Classification and the Optimization of Decision Trees
1973
Sharecropping and the Interlinking of Agrarian Markets
1982 StandoutNobel
Asset Prices in an Exchange Economy
1978 StandoutNobel
Debt Maturity Structure and Liquidity Risk
1991 StandoutNobel
Evolution and Intelligent Design
2008 StandoutNobel
Optimal long‐term control of a multipurpose reservoir with indirect users
1976
The Consequences of the Dependence of Quality on Price
1987 StandoutNobel
Credit Rationing in Markets with Imperfect Information
1981 StandoutNobel
Markets, Market Failures, and Development
1989 StandoutNobel
Revenue Management Under a General Discrete Choice Model of Consumer Behavior
2004 Standout
Generalization of White's Method of Successive Approximations to Periodic Markovian Decision Processes
1972
Credit Rationing: Reply
1987 StandoutNobel
Control Techniques for Complex Networks
2007
Solving H-horizon, stationary Markov decision problems in time proportional to log(H)
1990

Works of Marshall Freimer being referenced

Why Bankers Ration Credit
1965
Dynamic Programming and Markov Processes.
1961 Standout
Adaptive Control Processes: A Guided Tour.
1965
Some New Results on Compromise Solutions for Group Decision Problems
1976
Applied Dynamic Programming.
1964
a study of the generalized tukey lambda family
1988
Rankless by CCL
2026