Favorites - I mean is this any different than standard gradient descent with something like...

TropicalDingdong , 10 months ago

I mean is this any different than standard gradient descent with something like Adam as optimiser.

That’s my assumption based on the headline. But the quick skim I gave the article seemed to only discuss it in the context of NLP. Not exactly my field of study.

Reply

Report

Activity

Open original URL

Copy original URL

Copy Mbin URL

Loading...

anonymoose 10 months ago