|

Reinforcement learning

turkey pigeon, collared pigeon, bird, dove, animal, nature, fauna, meeting, dove, dove, dove, dove, dove

Reinforcement learning is a method that drives learning and memory in primitive species such as birds, humans and other living species to its use in machine learning. It is used to influence the behavior of us humans on social media to its use to train machines.

The essential components were initiated by BF Skinner 20th century’s most eminent psychologist. The key concept he developed was that the behavior that was rewarded would be repeated and followed. His key idea was on how to reinforce behavior by intermittent schedules of reinforcement. His belief was the complex behaviours sequences could be taught by reinforcing rewards. His key hypothesis is that human behavior is controlled by environment and the future of humanity could be saved by systematic control of behavior to specific desirable ends rather than haphazardly.

He developed his method through training of pigeons going so far as to make them so trained that they could be used to guide a missile for the US Military (Pigeon’s in a Pelican: https://www.appstate.edu/~steelekm/classes/psy3214/Documents/Skinner1960.pdf)

“To say that a reinforcement is contingent upon a response may mean nothing more than that it follows the response. It may follow because of some mechanical connection or because of the mediation of another organism; but conditioning takes place presumably because of the temporal relation only, expressed in terms of the order and proximity of response and reinforcement. Whenever we present a state of affairs which is known to be reinforcing at a given drive, we must suppose that conditioning takes place, even though we have paid no attention to the behavior of the organism in making the presentation.”

– B.F. Skinner, “Superstition’ in the Pigeon” (p. 168)

He developed the Operant conditioning chamber, later called the Skinner box, in which the animals were taught certain behavior’s by rewarding or punishing the animal’s actions.

By the time Skinner retired from Harvard, behaviorism declined, and his theories were criticized but this this method of reinforcement learning is used by machine learning methods nowadays to train a machine to recognize specific patterns while ignoring other patterns.

In addition –

The little “like” button or “number of followers” are today’s reward system that is used to reward behavior of a social media poster to post more…that grants more posts and more rewards.

Similar Posts

  • |

    Engaging website

    There are many ways that websites show their landing pages. Usually there are a lot of graphics. The Harvard innovation lab does it wonderfully with their landing page for one of their challenges showing particles which are responsive to mouse movement. Very dynamic and very intriguing! https://innovationlabs.harvard.edu/presidents-innovation-challenge/finalists The developer’s console tells you a bit about…

  • Observability

    Observability is important for AI and AI tools. It is the ability to monitor them for token usage, response quality and model drift. Typically, an AI system is monitored through logs, traces and metrics but an AI system on AI agent may need other metrics. Troubleshooting a complex AI system that produces its output probabilistically…

  • Plant communication

    Plants communicate with each other. That much is known. However, what is not known is how plants communicate using symbiosis. Dr Johnson has reported new work in Ecology letters that shows that plants use symbiotic fungi to enable communication. Communication also causes interesting secretion by plants when infected by aphids. These secretions that are volatile…

  • Vitamin K2

    It has been traditionally thought that for bone health, sufficient calcium is required. However, it is just not calcium. A vitamin called Vitamin K1 (phylloquinone) and K2 (menaquinone) in combination with Vitamin D2 and calcium is important for activating a protein called osteocalcin which binds to calcium to build bones. Osteocalcin is also involved in…

  • Squid camouflage

    Distributed light sensing Squid can really hide well in their environment. It is important for their survival since they have minimal active defenses but how is it important to us? Office of Naval Research thought that it was important enough to grant a $ 6 million grant to Roger Hanlon of Woods Hole Marine Biological…