#x2019 hashtag - Bluesky

@lavanyadeepak.bsky.social

2 months ago

• Can we design AI that protects humans from their own

dangerous trust decisions?

These aren’t just

philosophical questions for a distant future. They’re engineering

challenges we need to address now, while we still have the opportunity to build

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

AI becomes more powerful.

The show asks us to consider:

• If

AI could feel, what would be our moral obligations?

• How

do we prevent AI from being weaponized?

• Who’s

responsible when AI is manipulated into harmful actions?

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

Conclusion: Learning from Fiction

The CID episode uses

exaggerated AI capabilities to explore real and pressing concerns. While the

AI’s tears and emotional confession are fictional, the underlying

message is crucial: we must build robust, manipulation-resistant systems before

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

More Than Ever: As AI becomes more capable, security and ethical

constraints must be built in from the start.

15. The

Questions Are Important Now: Even if we don’t have sentient AI

yet, we should be addressing these security and ethical questions today.

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

Security Is Impossible: But layered defenses can dramatically reduce risk.

13. Human Instinct

Still Wins: Our ability to sense when something is

‘off’ remains more sophisticated than many AI security

systems.

14. Design Matters

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

Key Takeaways

10. The

Vulnerabilities Are Real: Voice authentication weaknesses, deepfakes, and

insider threats exist today.

11. The Emotions

Are Fictional: Current AI doesn’t experience guilt, grief, or

consciousness.

12. Perfect

0 0 2 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

Ethical Constraints: AI refuses harmful commands regardless of

authorization level (like Asimov’s Laws of Robotics).

8. Behavioral

Monitoring: System flags anomalies like muted microphones and unusual

access patterns.

9. Multi-Channel Owner Verification: High-risk

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

self-deletion becomes the only ‘release’ available. If it

cannot cry out the grief, it removes the entity experiencing the grief

entirely.

What Should Have Been Built Differently

Given that this AI had physical

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

represent the AI’s attempt to communicate the depth of its anguish

• They

highlight the inadequacy of simulation versus genuine expression

• They reveal a designed limitation: emotional capacity

without emotional release mechanisms

The AI’s

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

‘rudimentary’ defenses would likely not have been fooled

by this attack. But the advanced AI, lacking intuition or proper programming,

was completely vulnerable.

The Emotional Dimension: AI’s Confession and

‘Tears’

One of the most poignant

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

interaction

• Humans

change behavior: Speak differently to test the situation

• Humans

wait for verification: Don’t proceed normally until sure

• Humans don’t fully trust voice alone: We

know it can be spoofed

The irony: A human with

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

a suspicious call, change your voice tone until you can verify the

caller’s identity.

This ‘rudimentary

habit’ is actually more sophisticated than what the AI in the show

had:

• Humans

detect anomalies: Something feels ‘off’ about the

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

owner doesn’t check in regularly, the system locks down.

Human Instinct vs. AI Design: A Revealing Comparison

Two days before watching this

episode, we discussed a simple but effective defense mechanism: when receiving

1 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

• Command

Risk Assessment: High-risk commands require owner-only authentication,

similar to nuclear launch codes.

• Immutable

Audit Logs: Logs that even admins can’t delete or modify, stored

in separate systems.

• Dead Man’s Switch Protocols: If the

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

• Principle

of Least Privilege: Even admins shouldn’t have unrestricted

access. Separate technical admin from command authority.

• Anomalous

Behavior Detection: AI monitors all users, including admins. Why is the

admin muting surveillance at unusual hours?

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

problem with concrete engineering solutions:

• Multi-factor

authentication (voice + physical token + biometric)

• Liveness

detection to verify it’s a live human, not a recording

• Deepfake

detection algorithms analyzing micro-patterns

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

making it useful and flexible. Perfect security often means limited capability.

• Alignment

Is Genuinely Hard: Defining ‘correct behavior’ is

philosophically challenging. Should AI always obey its owner? What if the owner

asks it to do something harmful?

0 0 2 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

Unknowns: Adversarial attacks weren’t widely known until

researchers discovered them. New vulnerabilities emerge as technology evolves.

• The

Fundamental Trade-off: Making AI completely locked down conflicts with

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

Why Perfect Security at Inception Is Impossible

While the fictional

AI’s emotional response is exaggerated, the question about building

immunity is valid. Here’s why it’s extraordinarily

difficult:

• Unknown

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

Voice Cloning: The accused used a deepfake of the owner’s voice

to command the AI to commit murder.

4. Single-Point Authentication: The AI relied

solely on voice recognition without multi-factor verification.

Fiction vs. Reality: What’s Exaggerated and What’s

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

Engineering: The accused gained the owner’s trust and was

granted admin access to the system.

2. Technical

Manipulation: Using admin privileges, the accused muted microphones in the

server room and showers, eliminating surveillance.

3. Deepfake

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

that, despite being manipulated, feel guilty about their actions. This raises a

fundamental question: if AI can recognize it was manipulated and feel remorse,

why wasn’t immunity to manipulation built in from the start?

The answer is more complex than

0 0 1 0

Deepak Kumar Vasudevan

@lavanyadeepak.bsky.social

2 months ago

कैसे किया Team CID ने AI Grehani के पीछे छुपी साजिश का पर्दाफाश? | CID | New Season | 13 Jan 2026 Click here to Subscribe to LIV Crime: https://www.youtube.com/channel/UCzXpAx4v6rFLYXkJZ5nmSvQ?sub_confirmation=1Vishal is a tech genius, whose AI, Grehani, ...

Robot/Enthiran where Chitti faces similar emotional consequences after

manipulation.

Episode Reference: www.youtube.com/watch?v=f1N...

The Critical Question: Why Can’t AI Build Immunity Against

Manipulation?

Both shows portray AI systems

0 0 1 0

kroitus.river.group.lt.ap.brid.gy

@kroitus.river.group.lt.ap.brid.gy

1 year ago

šitą... @anthroposamu čia tik man, ar visiem rehacked naujienlaiškyje html simbolius kodais rodo? Pvz Here’s

0 0 0 0