Marshall Freimer

41 papers · 4.6k indexed citations

Standout Papers

Dynamic Programming and Markov Processes. (1961)
Marshall Freimer, Ronald A. Howard Journal of the American Statistical Association

Citation Impact

Citing Papers

Intensive versus Conventional Glucose Control in Critically Ill Patients

2009 Standout

Control strategies for a stochastic planner

1994

Acting Optimally in Partially Observable Stochastic Domains

1994

Cost-effective sensing during plan execution

1994

Price Expectations and the Phillips Curve

1969 StandoutNobel

Solving very large weakly coupled Markov decision processes

1998

Rewarding behaviors

1996

Econometric Analysis of Stabilization Policies

1969 StandoutNobel

Planning with deadlines in stochastic domains

1993

Reinforcement Learning with Factored States and Actions

2004 StandoutNobel

Monte Carlo Matrix Inversion and Reinforcement Learning

1993

A Theory and Test of Credit Rationing

1969 StandoutNobel

Neural Mechanisms of Hierarchical Planning in a Virtual Subway Network

2016 StandoutNobel

Tracking the Emergence of Conceptual Knowledge during Human Decision Making

2009 StandoutNobel

Mastering the game of Go without human knowledge

2017 StandoutNatureNobel

Human-level control through deep reinforcement learning

2015 StandoutNatureNobel

Continuous-Time Adaptive Critics

2007

Deep learning in neural networks: An overview

2014 Standout

Multiobjective dynamic programing with application to a reservoir

1979

Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control

2007

Hierarchical Reinforcement Learning with the MAXQ Value Function Decomposition

2000

Alternative Approaches to Analyzing Markets with Asymmetric Information: Reply

1983 StandoutNobel

A Neural Substrate of Prediction and Reward

1997 StandoutScience

Independence of irrelevant alternatives, and solutions to Nash's bargaining problem

1977 StandoutNobel

Modified Policy Iteration Algorithms for Discounted Markov Decision Problems

1978

On Sequential Decisions and Markov Chains

1962 Standout

A Probabilistic Production and Inventory Problem

1963

Planning under time constraints in stochastic domains

1995

The Pricing of Options and Corporate Liabilities

1973 StandoutNobel

An object-oriented representation for efficient reinforcement learning

2008

Reservoir Management and Operations Models: A State‐of‐the‐Art Review

1985 Standout

The Fair Wage-Effort Hypothesis and Unemployment

1990 StandoutNobel

Multilevel incremental dynamic programing

1976

Toward a Theory of Discounted Repeated Games with Imperfect Monitoring

1990

Dividend Policy: An Empirical Analysis

1968 StandoutNobel

IMPLEMENTATION OF AN OPTIMIZATION MODEL FOR OPERATION OF A METROPOLITAN RESERVOIR SYSTEM¹

1977

Classification and Regression Trees.

1984 Standout

Determinants of corporate borrowing

1977 Standout

Improved dynamic programing procedures and their practical application to water resource systems

1974

Optimizing decision trees through heuristically guided search

1978

On the Faustian Dynamics of Policy and Political Power

2011

Incentive Effects of Terminations: Applications to the Credit and Labor Markets

1983 StandoutNobel

Optimal control of Markov processes with incomplete state information

1965

Dynamic programming applications in water resources

1982

Reforming the Global Economic Architecture: Lessons from Recent Crises

1999 StandoutNobel

A dynamic programming successive approximations technique with convergence proofs

1970

A MEAN‐VARIANCE THEORY OF OPTIMAL CAPITAL STRUCTURE AND CORPORATE DEBT CAPACITY

1978

Technical Note: Q-Learning

1992 Standout

Human-level performance in 3D multiplayer games with population-based reinforcement learning

2019 StandoutScienceNobel

Pareto Optimality and Competition

1981 StandoutNobel

Saving and Liquidity Constraints

1991 StandoutNobel

An income fluctuation problem

1976

The complexity of stochastic games

1992

On Nonterminating Stochastic Games

1966

Dynamic Stability and Reform of Political Institutions

2005

Numerical maximum log likelihood estimation for generalized lambda distributions

2006

Persistence of Power, Elites, and Institutions

2008 StandoutNobel

The Folk Theorem with Imperfect Public Information

1994 StandoutNobel

Learning Finite-State Controllers for Partially Observable Environments

2013

DEBT AND TAXES*

1977 StandoutNobel

Reputation Acquisition in Debt Markets

1989 StandoutNobel

On the Theory of Infinitely Repeated Games with Discounting

1988

Distributed Lags: A Survey

1967

Cycles of Conflict: An Economic Model

2014 StandoutNobel

Optimal cartel equilibria with imperfect monitoring

1986

A STOCHASTIC DYNAMIC PROGRAMMING MODEL FOR THE OPTIMUM OPERATION OF A MULTI‐PURPOSE RESERVOIR¹

1973

The convergence of TD(?) for general ?

1992

The Optimal Control of Partially Observable Markov Processes over a Finite Horizon

1973

Adaptive optimal control for continuous-time linear systems based on policy iteration

2008 Standout

Reinforcement Learning: A Survey

1996 Standout

Online actor–critic algorithm to solve the continuous-time infinite horizon optimal control problem

2010 Standout

Game-theoretic models and the role of information in bargaining.

1979 StandoutNobel

The Contributions of the Economics of Information to Twentieth Century Economics

2000 StandoutNobel

Planning and acting in partially observable stochastic domains

1998 Standout

Corporate financing and investment decisions when firms have information that investors do not have

1984 Standout

Average reward reinforcement learning: Foundations, algorithms, and empirical results

1996

Using Expectation-Maximization for Reinforcement Learning

1997 StandoutNobel

Convergence rate analysis of the state increment dynamic programming method

1983

Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning

1999 Standout

Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation

1997

Reliability‐constrained reservoir control problems: 1. Methodological issues

1979

A MODEL OF WARRANT PRICING IN A DYNAMIC MARKET

1970

Stochastic dynamic programming with factored representations

2000

Rules Rather than Discretion: The Inconsistency of Optimal Plans

1977 StandoutNobel

Contraction Mappings in the Theory Underlying Dynamic Programming

1967

Externalities in Economies with Imperfect Information and Incomplete Markets

1986 StandoutNobel

A Partitioning Algorithm with Application in Pattern Classification and the Optimization of Decision Trees

1973

Sharecropping and the Interlinking of Agrarian Markets

1982 StandoutNobel

Asset Prices in an Exchange Economy

1978 StandoutNobel

Debt Maturity Structure and Liquidity Risk

1991 StandoutNobel

Evolution and Intelligent Design

2008 StandoutNobel

Optimal long‐term control of a multipurpose reservoir with indirect users

1976

The Consequences of the Dependence of Quality on Price

1987 StandoutNobel

Credit Rationing in Markets with Imperfect Information

1981 StandoutNobel

Markets, Market Failures, and Development

1989 StandoutNobel

Revenue Management Under a General Discrete Choice Model of Consumer Behavior

2004 Standout

Generalization of White's Method of Successive Approximations to Periodic Markovian Decision Processes

1972

Credit Rationing: Reply

1987 StandoutNobel

Control Techniques for Complex Networks

2007

Solving H-horizon, stationary Markov decision problems in time proportional to log(H)

1990

Works of Marshall Freimer being referenced

Why Bankers Ration Credit

1965

Dynamic Programming and Markov Processes.

1961 Standout

Adaptive Control Processes: A Guided Tour.

1965

Some New Results on Compromise Solutions for Group Decision Problems

1976

Applied Dynamic Programming.

1964

a study of the generalized tukey lambda family

1988