#TrainingData hashtag - Bluesky

@digitalblacksmiths.bsky.social

7 hours ago

Be IN AI's training data = they literally learned from YOU 🧠

#AISEO #AITrainingData #MachineLearning #SourceAuthority #ChatGPT #TrainingData #DigitalMarketing #AISource

2 0 0 0

A Chronic Voice

@achronicvoice.com

3 days ago

Here's Why You Should Never Use AI to Generate Your Passwords Researchers found that AI chatbots like ChatGPT, Claude, and Gemini are not good at producing secure passwords.

"trad #PasswordManagers are not #LLMs ..they r designed to produce a truly #random sequence, by taking #cryptographic bits & converting them into characters. These outputs are not based on existing #TrainingData & follow no patterns.": buff.ly/53oLoCZ

via @lifehacker.com

5 1 0 0

ProgrammerHumor.io

@programmerhumor-io.bsky.social

3 days ago

Propaganda Knows No Bounds

Propaganda Knows No Bounds

#Ai #Captcha #machinelearning #Trainingdata #Synthetic-data

programmerhumor.io/ai-memes/propaganda-know...

1 0 1 0

infoDOCKET

@infodocket.bsky.social

1 week ago

AI-Copyright Suits Target Pirate Sites Supplying Training #Data (via @bloomberglaw.com) news.bloomberglaw.com/ip-law/ai-co... #LLMs #GenAI #trainingdata #copyright

0 0 0 0

Nia Soler

@niasoler.mastodon.social.ap.brid.gy

1 week ago

AI companies want to harvest improv actors’ skills to train AI on human emotion Source: https://www.theverge.com/ai-artificial-intelligence/893931/ai-companies-handshake-improv-actors-training-data

Si la IA generativa es solo una «herramienta», ¿para qué necesitan las tecnológicas a profesionales que la entrenen con emociones humanas? Al registro masivo de datos biométricos y a la emulación de emociones le falta un buen acopio de muestras de ADN para […]

[Original post on mastodon.social]

1 2 0 0

mikronews

@calculito.bsky.social

2 weeks ago

(Who is) The Boss Thesis: AI’s visual defaults reveal not a masculine or feminine bias, but something stranger — a world built by male attention, then shaped…

AI was trained on the output of human civilization — and by authorship, that civilization was mostly male. The outputs center women. That contradiction is not a bug someone introduced.
medium.com/p/62558fbb15fe
#ArtificialIntelligence #TrainingData #DesignDefaults #RepresentationBias #AIEthics

0 0 0 0

mikronews

@calculito.bsky.social

2 weeks ago

(Who is) The Boss Thesis: AI’s visual defaults reveal not a masculine or feminine bias, but something stranger — a world built by male attention, then shaped…

I just published (Who is) The Boss medium.com/p/who-is-the...
#ArtificialIntelligence #TrainingData #DesignDefaults #RepresentationBias #AIEthics

1 0 0 0

mikronews

@calculito.bsky.social

2 weeks ago

(Who is) The Boss Thesis: AI’s visual defaults reveal not a masculine or feminine bias, but something stranger — a world built by male attention, then shaped by male desire, producing outputs that center women as subje...

Check out my latest article: (Who is) The Boss www.linkedin.com/pulse/who-th... via www.linkedin.com/in/ionoaie/

#ArtificialIntelligence #TrainingData #DesignDefaults #RepresentationBias #AIEthics

1 1 0 0

the5krunner

@the5krunner.bsky.social

2 weeks ago

Performance Condition | the5krunner Performance Condition Performance Condition is a real-time metric that shows how the body is performing relative to its established baseline during a run

Your Garmin shows a number mid-run that most athletes ignore — or misread. Performance Condition explained in full: what it measures, what distorts it, and when to act on it.
the5krunner.com/garmin-featu...
#Running #Garmin #TrainingData

2 0 0 0

Katherine Borges

@dnakath.bsky.social

1 month ago

Free or not to free Have you ever heard the saying, "If something is free, you're the product"? You innately know that's true. That's why Facebook has ads and l...

My latest AI blog post - this one is on training data and bots. Pretty funny captcha by ChatGPT! #AI #DNA #ArtificialIntelligence #CRAIGEN #privacy #trainingdata #genealogy gptfamilytree.blogspot.com/2026/02/free...

2 0 1 0

the5krunner

@the5krunner.bsky.social

1 month ago

HRV Data: What 5 Years of Daily Tracking Showed Me Five years of daily heart rate variability measurements reveal patterns about sustainable training, measurement protocols, and what metrics actually matter. Analysis includes position changes, strengt...

5 years of daily HRV tracking showed stability until I changed measurement position in Feb 2025. Metrics shifted immediately—but kept drifting for months. Position artifact or genuine adaptation? the5krunner.com/2026/02/15/h... #running #HRV4Training #heartratevariability #trainingdata

0 0 0 0

iMerit

@imerit.bsky.social

1 month ago

We’re heading to the India AI Impact Summit 2026.
Meet the iMerit team at Booth 1.45, Bharat Mandapam, New Delhi, to discuss data quality, model evaluation, and human-in-the-loop AI for real-world deployment.

#AIImpactSummit #ModelEvaluation #TrainingData

0 0 0 0

gtbarry

@gtbarry.bsky.social

2 months ago

Even Starlink Wants Your Data for AI Model Training. How to Opt Out SpaceX uses your data to train its machine learning and AI models and might share that with partners who 'help us develop AI-enabled tools that improve your customer experience.'

SpaceX has updated its Starlink privacy policy to say customer data can be used to train AI models, and subscribers are opted in by default

Also, the company might share personal information with third parties

#privacy #artificialintelligence #AI #trainingdata #spacex #starlnk #bigdata #data #tech

2 0 0 0

Bob Carver

@cybersecboardrm.bsky.social

2 months ago

Cybersecurity Experts Warn That Most Gmail Users Don’t Realize This AI Setting Is Already Turned On
Inbox www.inc.com/leila-sh... #cybersecurity #privacy #Gmail #AI #trainingdata

0 0 0 0

Association of Research Libraries

@arl.org

2 months ago

FREE UKSG webinar: The Open Access–AI Conundrum: does free to read mean free to train? - UKSG This is a fantastic opportunity to listen to expert speakers with no travelling required. This is a free webinar - Please note that advance registration is required. This webinar will be recorded and ...

Upcoming UKSG Webinar (Thu Feb 5): The Open Access – #AI Conundrum: Does Free to Read Mean Free to Train? www.uksg.org/events/free-... #LLMs #oa #trainingdata #scholcomm #publishing
@uksg.bsky.social

4 1 0 0

infoDOCKET

@infodocket.bsky.social

2 months ago

Upcoming UKSG Webinar (Feb. 5, 2026): The Open Access – #AI Conundrum: Does Free to Read Mean Free to Train? www.uksg.org/events/free-... #LLMs #oa #trainingdata #scholcomm #publishing @uksg.bsky.social

3 2 0 0

Gerrit Eicker

@eicker.bsky.social

2 months ago

#LLMs like #OpenAI’s #GPT and #Google’s #Gemini #store portions of their #trainingdata, contradicting claims that they only learn #patterns. This “#memorisation” poses #legalrisks for AI companies, potentially leading to #copyrightinfringement lawsuits. The phenomenon also challenges the industry’s…

0 0 0 0

the5krunner

@the5krunner.bsky.social

2 months ago

Strava Files for IPO: Everything you need to know. Strava has reportedly filed for a 2026 IPO with suggestions of a target valuation of $3B. From the inevitable rollout of feed ads to new AI-coaching features, we analyze how going public will fundamen...

Strava files for IPO at $3B valuation! What it means for your training data, ads, privacy & more. Deep dive: the5krunner.com/2026/01/09/s... #Strava #IPO #FitnessTech #Running #Cycling #TrainingData #SportsTech #Athletes

0 1 0 0

Gerrit Eicker

@eicker.bsky.social

2 months ago

#Anthropic is challenging the prevailing belief in Silicon Valley that scaling up #compute and #infrastructure is the only path to success. Instead, Anthropic is focussing on #algorithmicefficiency, #smarterdeployment, and higher quality #trainingdata to achieve powerful models with less resources.…

0 0 1 0

Fyx

@fyx.jp

3 months ago

The Training Data Apocalypse What happens when official sources become disinformation? Future AI won't be able to tell you why something is wrong.

fyx.jp/l/en-US/diar... #AIFuture #Misinformation #TrainingData

0 0 0 0

James German, Hero-Maker

@jamesgerman.bsky.social

3 months ago

#TrainingData

(free on my substack)

open.substack.com/pub/lordstre...

#AI
#MDGP

0 0 0 0

Nicky 🦋🐛🌱🌿🐈‍⬛🐈 (she/her)

@nhallerwilson.bsky.social

3 months ago

OUR #paintings, OUR #words, OUR #music, OUR #faces, and OUR #voices - ARE NOT THEIR TRADE SECRETS !

12/8/2025, Stanford U., 10 am, hearing for #AB-412 #Generative #artificialintelligence: #trainingdata: #copyrighted materials

leginfo.legislature.ca.gov/faces/billNa...

2 0 1 0

Ars Technica News

@arstechni.ca

3 months ago

OpenAI desperate to avoid explaining why it deleted pirated book datasets https://arstechni.ca #copyrightinfringement #piratingbooks #onlinepiracy #trainingdata #ChatGPT #Policy #openai #AI

0 0 0 0

Tobias Zeumer

@vform.openbiblio.social.ap.brid.gy

4 months ago

Original post on openbiblio.social

Libraries open their archives to train AI chatbots with books spanning centuries of human knowledge | Milwaukee Independent

www.milwaukeeindependent.com/newswire/libraries-open-...

"Now that such text is of use as […]

0 3 0 0

Chris Lombardi

@chrisblue.bsky.social

4 months ago

Meta Says Porn Stash was for 'Personal Use,' Not Training AI Models Will they unmask the gooners?

br00t4c@mastodon.social - Meta Says Porn Stash was for 'Personal Use,' Not Training AI Models

#TrainingData #ContentModeration #TechIndustry #PrivacyConcerns #UnmaskGooners

gizmodo.com/meta-says-po...

0 0 0 0

PyrexCorex

@pyrexcorex.bsky.social

5 months ago

#trainingdata

0 0 0 0

Muhammad Ali

@muhammadalighugh.bsky.social

5 months ago

"Training data is the silent hero behind any AI — its quality, diversity, and balance define your model’s limits. In 2025, dataset sizes double every eight months, pushing us toward synthetic data techniques — but beware “model collapse” risks arise when models train on their own outputs. How confident are you in your training data’s foundation? #AI #TrainingData #MachineLearning #DataQuality

Training data is the silent hero behind any AI its quality,diversity,and balance define your model’s limits.In 2025,dataset sizes double every eight months,pushing us toward synthetic data techniques but beware “model collapse” risks arise when models train on their own outputs.
#AI #TrainingData

2 1 0 0

Ars Technica News

@arstechni.ca

5 months ago

AI models can acquire backdoors from surprisingly few malicious documents https://arstechni.ca #UKAISecurityInstitute #alanturinginstitute #AIvulnerabilities #backdoorattacks #machinelearning #datapoisoning #trainingdata #LLMsecurity #modelsafety #pretraining #AIresearch #AIsecurity…