Kosmix
One sec... we're building your guide for
Reinforcement Learning
Reinforcement learning
Not looking for Reinforcement learning? See
Reinforcement (psychology)
Overview
Main ›
Tweets
Twitter.com
One sec... we're getting the
Tweets
More from Twitter.com »
Related in the Kosmos
?
Markov models
POMDP
Markov decision process
Machine learning
(241)
Advances in Neural Information Processing Systems
Artificial evolution
Artificial neural networks
Backpropagation
Bayesian networks
Bayesian networks
Computational learning theory
Ensemble learning
Evolutionary algorithms
Feature selection
Function approximation
Gaussian process
Grammatical inference
ICML
Inductive logic programming
Journal of Machine Learning Research
Kernel methods
Neural networks
Perceptron
Q learning
Robot learning
Rule induction
SARSA
Semi-supervised learning
Statistical learning
Supervised learning
Support vector machines
Temporal difference learning
Text classification
Transfer learning
Unsupervised learning
ALOPEX
Accuracy paradox
Activity recognition
Adjusted Mutual Information
Adjusted rand index
Algorithmic inference
Algorithmic learning theory
Alpha (machine learning)
Alternating decision tree
Analogical modeling
Apprenticeship learning
Artificial intelligence conferences
Auto-encoder
Automated submission web directories
Bag of words model
Base rate
Bipropagation
Bondy's theorem
Bongard problem
Bootstrap aggregating
Bootstrapping (machine learning)
Bregman divergence
CBCL (MIT)
CIML community portal
CN2 algorithm
Calibration (statistics)
Canopy clustering algorithm
Cascade correlation algorithm
Category utility
Class membership probabilities
Classification algorithms
Cluster analysis
Cluster-weighted modeling
Co-training
Compositional pattern-producing network
Concept class
Concept drift
Concept learning
Conceptual clustering
Conditional random field
Confusion matrix
Constructive induction
Cover's theorem
Cross-entropy method
Cross-validation (statistics)
Crossover (genetic algorithm)
Curse of dimensionality
DBSCAN
Data Pre-processing
Decision lists
Decision rules
Dendrogram
Deterministic policy
Dimension reduction
Discriminative model
Dominance-based Rough Set Approach
Dynamic time warping
EURASIP Journal on Advances in Signal Processing
Eager learning
Elastic Matching
Elbot
Ensembles of classifiers
Evolvability (computer science)
Expectation propagation
Expectatio- n-maximization algorithm
Explanation-based learning
Exponential mechanism (differential privacy)
FLAME clustering
FastICA
Feature extraction
Feature vector
First Order Inductive Learner
Formal concept analysis
Forward-backward algorithm
Gaussian process emulator
General Architecture for Text Engineering
Generalization error
Generative model
Generative topographic map
Genetic Algorithm for Rule Set Production
Gittins index
Glivenko–Cantelli theorem
GoldenGem
Granular computing
Group method of data handling
Growing self-organizing map
Helmholtz machine
Hidden Markov model
Hierarchical hidden Markov model
Hierarchical temporal memory
IAPR
ID3 algorithm
Inductive bias
Information Fuzzy Networks
Information bottleneck method
Instance-based learning
Iris flower data set
Jabberwacky
Java Machine Learning Library
K-means++
Kernel adaptive filter
Kernel methods for machine learning
Kernel principal component analysis
Kernel trick
Knowledge discovery
Knowledge integration
Language Acquisition Device (computer)
Layered hidden Markov model
Lazy learning
Leabra
Learning Automata
Learning Vector Quantization
Learning classifier system
Learning with errors
Linde-Buzo-Gray algorithm
List of machine learning algorithms
Locally weighted regression
Logistic model tree
LogitBoost
Machine Learning (journal)
Machine learning researchers
Mallet (software project)
Margin classifier
Markov decision process
Markov models
Matthews correlation coefficient
MeeMix
Message-passing method
Meta learning (computer science)
Minimum redundancy feature selection
Mixture model
Multi-armed bandit
Multi-task learning
Multiple-instance learning
Nearest neighbor search
Neural modeling fields
Non-negative matrix factorization
Nonlinear dimensionality reduction
Novelty detection
Numenta
OPTICS algorithm
Offline learning
One-class classification
Online learning model
Online machine learning
Overfitting
POMDP
PROGOL
Pachinko machine
Parity learning
Pattern recognition
Predictive learning
Predictive state representation
Premature convergence
Principal component analysis
Principle of maximum entropy
Prior knowledge for pattern recognition
Probability matching
Probably approximately correct learning
Quadratic classifier
Quadratic unconstrained binary optimization
Rademacher complexity
Radial basis function network
Rand index
Random forest
Random multinomial logit
RapidMiner
Regularization (mathematics)
Relevance vector machine
Rough set
Rprop
Sample exclusion dimension
Self-organizing map
Semantic analysis (machine learning)
Semantic mapping (statistics)
Sequential Minimal Optimization
Shattering
Shogun (toolbox)
Simultaneous localization and mapping
Smart variables
SmartMatch
Soft independent modelling of class analogies
Sparse PCA
Spiking neural network
Statistical classification
Stochastic gradient descent
Structural risk minimization
Structure mapping engine
Structured SVM
Teaching dimension
Test set
The iDistance Technique
Training set
Transduction (machine learning)
Ugly duckling theorem
Uncertain data
Uniform convergence (combinatorics)
Unique negative dimension
Universal Robotics
VC dimension
Variable-order Markov model
Venn-networks
Version space
Viterbi algorithm
Weighted Majority Algorithm
Weka (machine learning)
Win-Stay, Lose-Switch
Witness set
k-means clustering
k-medoids
k-nearest neighbor algorithm
more...
Cybernetics
Theoretical neuroscience
Biological cybernetics
Machine learning
Neural networks
Artificial evolution
Artificial intelligence
(13)
Action selection
Logic programming
Recurrent neural networks
Long short term memory
Cognitive robotics
Cognitive robotics
Multi-agent systems
Evolutionary computation
Computational intelligence
Machine learning
Q learning
ICML
SARSA
Transfer learning
more...
Learning
Meta learning
Connectionist
Sequence learning
Machine learning
Algorithms
Genetic algorithms
Machine learning
Evolutionary algorithms
Statistical models
Graphical models
Independent component analysis
Function approximation
Machine learning researchers
(28)
Jürgen Schmidhuber
Marcus Hutter
Michael L. Littman
Michael i jordan
Sebastian Thrun
Sebastian Thrun
Zoubin Ghahramani
Alberto Broggi
Ayanna M. Howard
Bernhard Schölkopf
Brian D. Ripley
Ernst Dickmanns
Geoffrey Hinton
Hartmut Neven
Jacek M. Zurada
Jaime Carbonell
Karl Steinbuch
Katia Sycara
Leo Breiman
Luca Maria Gambardella
Léon Bottou
Michael Collins (computational linguist)
Peter Flach
Pierre Baldi
Ross Quinlan
Stephen Muggleton
Steve Omohundro
Vladimir Vapnik
Yann LeCun
more...
Optimization algorithms
Dynamic programming
Gradient descent
Ant colony optimization
Artificial neural networks
Evolutionary algorithms
Artificial evolution
Artificial intelligence conferences
(8)
AAAI
European Conference on Machine Learning
IJCAI
AI@50
Advances in Neural Information Processing Systems
Advances in Neural Information Processing Systems
European Conference on Artificial Intelligence
ICML
List of artificial intelligence conferences
more...
Computational neuroscience
(48)
Hebbian learning
Neural computation
Action potential
Advances in Neural Information Processing Systems
Artificial Intelligence System
Artificial Intelligence System
Artificial neural networks
BCM theory
Bayesian brain
Biological neuron model
Blue Brain Project
Brain-reading
Brian (software)
CARET (Computerized Anatomical Reconstruction and Editing Toolkit)
Cable theory
Cerebellar Model Articulation Controller
Connectionist
Connectome
Cultured neuronal network
Dendritic spine
Diffusion Networks
Fast Analog Computing with Emergent Transient States (FACETS)
FitzHugh–Nagumo model
Hindmarsh-Rose model
Hodgkin–Huxley model
International Neuroinformatics Coordinating Facility
Laurent Itti
Linear-non- linear-Poisson cascade model
Modular neural networks
Neural Field Theory
Neural backpropagation
Neural coding
Neural networks
Neuro cybernetics
Neurodynamics
Neuroinformatics
Neuroinformatics (journal)
Neuron (software)
Neuronstudio
Neurotechnology
Parallel Constraint Satisfaction Processes
Pulse computation
Softmax activation function
Soliton model
Spike-triggered average
Spike-triggered covariance
Temporal difference learning
Theoretical neuromorphology
Wilson-Cowan model
more...
Neural networks
(92)
Biological neural networks
Radial basis functions
ADALINE
ALOPEX
Activation function
Activation function
Adaptive resonance theory
Advances in Neural Information Processing Systems
Artificial Intelligence System
Artificial neural networks
Artificial neuron
Auto-encoder
Autoassociative memory
Backpropagation
Bidirectional Associative Memory
Bipropagation
Boltzmann machine
Canopy clustering algorithm
Cascade correlation algorithm
Cellular neural network
Cerebellar Model Articulation Controller
Committee machine
Compositional pattern-producing network
Computational cybernetics
Computational neurogenetic modeling
Confabulation (neural networks)
Cortical column
Cultured neuronal network
Cybenko theorem
Delta rule
Early stopping
European Neural Network Society
Evolutionary Acquisition of Neural Topologies
Feed-forward
Feedforward neural network
Fuzzy cellular neural networks
Generalized Hebbian Algorithm
Group method of data handling
Growing neural gas
Growing self-organizing map
Helmholtz machine
Hopfield net
Hybrid neural network
HyperNEAT
Instantaneously trained neural networks
Interactive Activation and Competition
K-means++
Learning Vector Quantization
Lernmatrix
Linde-Buzo-Gray algorithm
Liquid state machine
Long short term memory
Memory-prediction framework
Modular neural networks
MoneyBee
Multilayer perceptron
NETtalk (artificial neural network)
Neocognitron
Neural Field Theory
Neural Networks (journal)
Neural backpropagation
Neural computation
Neural cryptography
Neural gas
Neural network software
Neurally controlled animat
Neuroevolution of augmenting topologies
Neuroplasticity
Oja's rule
Optical neural network
Perceptron
Pulse-coupled networks
Quantum neural network
Radial basis function network
Random neural network
Recurrent neural networks
Reservoir computing
Rprop
SNARC
Self-organizing map
Semantic neural network
Sigmoid function
Softmax activation function
Spiking neural network
Stochastic neural network
Support vector machines
Synaptic weight
Tensor product network
The Emotion Machine
Time delay neural network
Venn-networks
Winner-take-all
k-means clustering
more...
Artificial intelligence researchers
Sven Koenig (computer scientist)
Daphne Koller
Stuart J. Russell
Jürgen Schmidhuber
Zoubin Ghahramani
Michael i jordan
Sebastian Thrun
See also
(20)
Value iteration
Computer science
Policy iteration
John shawe taylor
IDSIA
IDSIA
Value function
Richard Sutton
Algorithmic information
Algorithmic probability
Peter dayan
Kolmogorov complexity
Emanuele Tesauro
Multiagent
Probabilistic
Solomonoff induction
Yakov Peters
Phenomenon
Richard S. Sutton
John Langford (computer scientist)
Journal of Artificial Intelligence Research
more...
more categories...