Advertisement ยท 728 ร— 90
#
Hashtag
#PDFMining
Advertisement ยท 728 ร— 90
Post image

Back at itโ€”system gave us 500 gemsโ€ฆ and 10ร— more junk ๐Ÿ˜‚. Quick tweaks and weโ€™re nearly done with stage one: mining pretrain data from rare, cross-domain PDFs.

#AIpretrain #SpanAware #TokenizerFree #PDFMining #XSpanformer #DataCuration #OpenScience
#artificalintelligence

0 0 1 0