sub:assertion {
sub:assertion dcterms:creator <
https://w3id.org/np/RAoSadUw99CeqDlR2400018nqTzR_38fT86OrTzk16Vts> ;
rdfs:comment """ Human feedback is critical for aligning LLMs, so why don’t we collect it in the open ecosystem?๐ง
We (15 orgs) gathered the key issues and next steps.
Envisioning
a community-driven feedback platform, like Wikipedia
https://www.alphaxiv.org/abs/2408.16961
๐งต https://twitter.com/LChoshen/status/1831708316982231235/photo/1
We define 5 axes of openness:
Methodology (how its collected)
Access (who can use it)
Models (one\\many)
Contributors (as diverse as its uses?)
Time (keeps updating? closed models improve over several feedback iterations, and of course, models change)
Is current feedback open?๐ฅถ https://twitter.com/LChoshen/status/1831708319855362549/photo/1
In our paper, we first learn from peer production efforts like wiki and stack overflow. These case studies tell us how important it is to align incentives of different bodies, allow the community to dictate the policies, etc.
Then, we hone on 6 crucial areas to develop open human feedback ecosystems:
Incentives to contribute, reducing contribution efforts, getting expert and diverse feedback, ongoing dynamic feedback, privacy and legal issues. https://twitter.com/LChoshen/status/1831708324284522965/photo/1
We believe a successful ecosystem must center around feedback loops
where anyone can spin up a community model, for storytelling, Bengali or anything else
Others can use it, give feedback, and benefit from a model that keeps improving with the contributions https://twitter.com/LChoshen/status/1831708326717161683/photo/1
The feedback from all models will be open and collected in one pool,
helping beyond the specialized models created to future research and general improvement
This was a huge effort and the paper is packed with ideas thanks to:
#deepRead
@Shachar_Don @ben_burtenshaw @RamonAstudill12 @cailean_osborne @MimansaJ @tzushengkuo @wzhao_nlp @IdanShenfeld @TheAndiPenguin @Yurochkin_M @Dr_Atoosa @YangsiboHuang @tatsu_hashimoto @YJernite @dvilasuero @AbendOmri @jen_gineered @sarahookr @hannahrosekirk
Note, we don't only preach open, this was open with contributions from so many organizations
@CohereForAI @MITIBMLab @IBMResearch @huggingface @nlphuji @UniofOxford @MIT_CSAIL @StanfordHAI @turinginst @princeton_nlp
@cmuhcii @EdinburghUni @cornell_tech
Please ask us anything, share, discuss and talk to us, we are going to make it real! Together!
Much much more in the paper:
https://www.alphaxiv.org/abs/2408.16961
""" ;
schema:keywords "LLMs" , "communitydriven" , "deepRead" , "feedbackplatform" , "humanfeedback" , "interdisciplinary" , "openecosystem" ;
<
https://sense-nets.xyz/announcesResource> <
https://www.alphaxiv.org/abs/2408.16961> ;
<
https://sense-nets.xyz/recommends> <
https://www.alphaxiv.org/abs/2408.16961> ;
<
https://sense-nets.xyz/summarizes> <
https://www.alphaxiv.org/abs/2408.16961> .
<
https://www.alphaxiv.org/abs/2408.16961> <
https://sense-nets.xyz/hasZoteroItemType> "webpage" .
}