Delayed reinforcement learning for adaptive image segmentation and feature extraction

Jing Peng, Bir Bhanu

Research output: Contribution to journalArticlepeer-review

26 Scopus citations


Object recognition is a multilevel process requiring a sequence of algorithms at low, intermediate, and high levels. Generally, such systems are open loop with no feedback between levels and assuring their robustness is a key challenge in computer vision and pattern recognition research. A robust closed-loop system based on "delayed" reinforcement learning is introduced in this paper. The parameters of a multilevel system employed for model-based object recognition are learned. The method improves recognition results over time by using the output at the highest level as feedback for the learning system. It has been experimentally validated by learning the parameters of image segmentation and feature extraction and thereby recognizing 2-D objects. The approach systematically controls feedback in a multilevel vision system and shows promise in approaching a long-standing problem in the fleld of computer vision and pattern recognition.

Original languageEnglish
Pages (from-to)482-488
Number of pages7
JournalIEEE Transactions on Systems, Man and Cybernetics Part C: Applications and Reviews
Issue number3
StatePublished - 1998


  • Adaptive feature extraction
  • Adaptive image segmentation
  • Learning for multilevel vision
  • Learning in computer vision
  • Modelbased recognition
  • Multiscenario object recognition
  • Recognition feedback


Dive into the research topics of 'Delayed reinforcement learning for adaptive image segmentation and feature extraction'. Together they form a unique fingerprint.

Cite this