This week's book giveaway is in the Artificial Intelligence and Machine Learning forum.
We're giving away four copies of Zero to AI - A non-technical, hype-free guide to prospering in the AI era and have Nicolò Valigi and Gianluca Mauro on-line!
See this thread for details.
Win a copy of Zero to AI - A non-technical, hype-free guide to prospering in the AI era this week in the Artificial Intelligence and Machine Learning forum!
  • Post Reply Bookmark Topic Watch Topic
  • New Topic
programming forums Java Mobile Certification Databases Caching Books Engineering Micro Controllers OS Languages Paradigms IDEs Build Tools Frameworks Application Servers Open Source This Site Careers Other all forums
this forum made possible by our volunteer staff, including ...
Marshals:
  • Campbell Ritchie
  • Liutauras Vilda
  • Paul Clapham
  • Bear Bibeault
  • Jeanne Boyarsky
Sheriffs:
  • Ron McLeod
  • Tim Cooke
  • Devaka Cooray
Saloon Keepers:
  • Tim Moores
  • Tim Holloway
  • Jj Roberts
  • Stephan van Hulst
  • Carey Brown
Bartenders:
  • salvin francis
  • Scott Selikoff
  • fred rosenberger

Reinforcement Learning In Action - Multi Label Classification

 
Greenhorn
Posts: 21
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Hi Alexander and Brandon.

Can Reinforcement Learning be used for Multi-Label classification?
We have a potential application where the labels run to tens or perhaps a few hundred thousand different labels. Is Reinforcement Learning something we should be investigating for this?

Thanks
Don.
 
Author
Posts: 7
5
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator

Don Horrell wrote:Hi Alexander and Brandon.

Can Reinforcement Learning be used for Multi-Label classification?
We have a potential application where the labels run to tens or perhaps a few hundred thousand different labels. Is Reinforcement Learning something we should be investigating for this?

Thanks
Don.



If it's a static sort of classification problem like image classification, then RL wouldn't be the right way to conceptualize the problem. However, if your classification problem exists in a dynamic environment, such as deciding which advertisement to display for a particular user, which may change depending on dynamic user and sitewide data, then RL could be useful (see multi-armed bandits). RL is a framework for decision-making in a dynamic environment.
 
Don Horrell
Greenhorn
Posts: 21
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator
Thank you for your interesting comment.

Could you explain a bit more about what you mean by a "dynamic environment" please?

So, if Reinforcement Learning can select an advertisement for a user, that sounds similar to classification (of images?). A person walks past a smart advertising board which somehow identifies that person and the "label" is the type of advert that will be displayed for that person.
Or have I got the wrong end of the stick? Or perhaps even the wrong stick?


Cheers
Don.
 
Brandon Brown
Author
Posts: 7
5
  • Mark post as helpful
  • send pies
  • Quote
  • Report post to moderator

Don Horrell wrote:Thank you for your interesting comment.

Could you explain a bit more about what you mean by a "dynamic environment" please?

So, if Reinforcement Learning can select an advertisement for a user, that sounds similar to classification (of images?). A person walks past a smart advertising board which somehow identifies that person and the "label" is the type of advert that will be displayed for that person.
Or have I got the wrong end of the stick? Or perhaps even the wrong stick?


Cheers
Don.



Yes the label would be one of some finite number of possible advertisements. The objective here, however, would be for the RL algorithm to optimize the clickthrough rate. So rather than an image classification where the algorithm is trained based on whether or not the classification is correct or not, the decision here is not binary. There is no one correct ad, some ads will result in more clicks (if these are ads on a website) than others, and the goal is to learn which ad will cause a potential customer to be most likely to respond to the ad by buying.

A dynamic environment just to mean that again, the decision isn't a correct/incorrect labeling, but a set of actions that lead to more or less of some outcome (points in a game, clicks for ads, money if trading stocks, etc)
 
knowledge is the difference between drudgery and strategic action -- tiny ad
the value of filler advertising in 2020
https://coderanch.com/t/730886/filler-advertising
reply
    Bookmark Topic Watch Topic
  • New Topic