Matches in Nanopublications for { ?s ?p ?o <https://w3id.org/np/RAmEQd6Xc3YHZ6uH6_kOfPKx1SDVV-wn6aK8LrvcmNyZE#assertion>. }
Showing items 1 to 12 of
12
with 100 items per page.
- assertion type Observation assertion.
- assertion type Question assertion.
- assertion type Claim assertion.
- assertion comment " Scaling laws don't care about scale of the "train" models? Did anyone else get this? When I predict a scaling law, the scale of the largest model matters, but the num-models for fitting matters much much much more. Initial results, scaling error by #models starting from largest https://twitter.com/LChoshen/status/1803401845626511568/photo/1 Maybe more simply put: You can predict a scaling law with 8 small models, and it would be better than 3 large ones (that costs a lot) Is that something anyone else seen? " assertion.
- assertion creator RAoSadUw99CeqDlR2400018nqTzR_38fT86OrTzk16Vts assertion.
- assertion keywords "AI" assertion.
- assertion keywords "cost" assertion.
- assertion keywords "initialresults" assertion.
- assertion keywords "models" assertion.
- assertion keywords "modelscale" assertion.
- assertion keywords "scalinglaws" assertion.
- assertion keywords "training" assertion.