Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
sanxiyn
on Nov 23, 2022
|
parent
|
context
|
favorite
| on:
CICERO: An AI agent that negotiates, persuades, an...
Eh, it does learn from self play via RL. One section of the paper is literally titled "Self-play reinforcement learning for improved value estimation". Yes, that's only a small part of the entire system.
andreyk
on Nov 24, 2022
[–]
ah, good catch; I was going off of the blog post
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: