EWOK datasets are built to have this trait https://x.com/neuranna/status/1791465842632454184 Interesting trend will it last? solve contamination? https://twitter.com/LChoshen/status/1806396147281637645/photo/1
@XuanmingZhang07 @Zhou_Yu_AI @columbianlp If you ask me, a nice step, but it only solves the worst contamination (clear training on the test set). Not on just training on similar formats, synthetic data etc. to improve. So it is a good approach that should last, but we need more. (@deliprao you had similar claim right?)