David Chuan-En Lin (@davidlin.io) Bsky

Visual transcripts: lecture notes from blackboard-style lecture videos: ACM Transactions on Graphics: Vol 34, No 6 Blackboard-style lecture videos are popular, but learning using existing video player interfaces can be challenging. Viewers cannot consume the lecture material at their own pace, and the content is also difficult to search or skim. For these reasons, ...

• Rubyslippers dl.acm.org/doi/10.1145...
• QuickCut dl.acm.org/doi/10.1145...
• Visual transcripts dl.acm.org/doi/abs/10....

1 year ago 0 0 0 0

MixT | Proceedings of the 25th annual ACM symposium on User interface software and technology

A few related HCI works I am inspired by
• Video digests dl.acm.org/doi/10.1145...
• MixT dl.acm.org/doi/10.1145...

1 year ago 0 0 1 0

So, I built this quick tool to "unroll" a video (which is temporal and sequential in nature) into a "flattened" article. The article is segmented into steps and has demonstrative video clips that align with the steps.

1 year ago 0 0 1 0

I don't want to just read a recipe article because I want to see how specific techniques are performed. For example, kneading pizza dough.

1 year ago 0 0 1 0

I've been learning new food recipes.

However, it's challenging to watch a cooking video (play/pause/go back/jump forward) while actively cooking at the same time.

1 year ago 0 0 1 0

Unroll a video.

instructional video → step-by-step animated guide

1 year ago 0 0 1 0

Interpolate abstract concepts using analogies.

peaceful → dove
aggressive → falcon

1 year ago 0 0 0 0

Multimodal interpolation with text ↔ image ↔ audio.

1 year ago 0 0 0 0

Interpolate concepts in latent space

1 year ago 0 0 0 0

Transforming between modalities could be interesting.

text ↔ image ↔ video

text → image: image generation
image → video: video generation
video → image: highlight detection
image → text: image captioning

1 year ago 0 0 0 0

🤏 Semantic pinching

What if you can pinch your screen to transform an article 📝 into an emoji 💪 and reverse!

Here is a simple prototype that uses LLM + gestures to transform text between different levels of abstractions:
emoji ↔ word ↔ sentence ↔ paragraph ↔ article

🌐 semanticpinching.vercel.app

1 year ago 0 0 1 0

Posts by David Chuan-En Lin