Advertisement · 728 × 90

Posts by Diptanu Choudhury

If you are building agents, would love to have a conversation. While many companies in this space focuses on developer experience, we are also focused on performance and consistency beyond the DX!

1 week ago 2 0 1 0

I need to post here more often!

We have been working on sandboxes for agents @tensorlake.ai and have built the fastest sandbox infrastructure that is out there. Fastest file system, consistent sub second startup times and stateful resume and suspend of sandboxes for durable agents!

1 week ago 3 0 1 0

Job update: a couple of weeks ago, I joined @tensorlake.ai full time. I’m having a lot of fun building the product with @diptanu.bsky.social and the rest of this wonderful team.

We have a few open positions if you’d like to work with us: www.linkedin.com/jobs/search/...

7 months ago 8 4 1 1

Some more color on this. What does it mean to help models solve interesting problems with MCP?

MCP is the easiest way to give models access to the gazzilion APIs enterprises have already built.

yes, it’s a security nightmare but nevertheless it helps teams with the Day 1 problems of 🚢 MVPs

7 months ago 3 0 1 0

@chris.blue your post is spot on about the technical merits(or lack of) of MCP. Just sharing perspectives of people I have talked to :)

7 months ago 1 0 0 0

I was a skeptic until I heard banks and hedge funds using MCP to enable models solve interesting business problems. And yes OpenAPI + tool calls would have just worked fine but developers like frameworks and abstractions for building the first MVP. MCP being more prescriptive thank OpenAPI helps.

7 months ago 3 0 1 1

One of our customers told me today the biggest lift for structured extraction with @tensorlake is that their engineering team can now tweak the schema they want to extract from documents every week as they evolve their insurance platform.

These little things make AI find roots in enterprises.

9 months ago 2 0 0 0
🚀 Tensorlake Cloud is Here: Document Ingestion and Orchestration for AI Workflows
🚀 Tensorlake Cloud is Here: Document Ingestion and Orchestration for AI Workflows Tired of brittle pipelines and unreliable document parsing? Tensorlake is the only platform built for mission-critical document ingestion and orchestration i...

Announcing Tensorlake Cloud

Up-leveling Document Ingestion and Workflows for building agentic applications and complex business workflows.

www.youtube.com/shorts/OCv-...

11 months ago 8 3 1 2

This would be a problem if you used the data in an agentic application for a real estate business.

This is why Table Structure Understanding models are still the most reliable way to parse tables in business critical applications.

1 year ago 2 0 0 0
Advertisement
Post image

Here's an instance where Gemini Flash 2 misinterpreted a table, sourced from a residential property's mold report. It mistakenly attributed mold data from outside the property to a bedroom, as it failed to parse some rows from the third column

1 year ago 3 0 3 0

The emergence of browser agents makes me feel we are lacking a google for APIs. Instead developers are making LLMs search for things, and turning websites into APIs by making LLMs click on the UI like humans. This approach is going to take a long time to get right, and wasteful for a lot of reasons

1 year ago 3 0 1 0
Post image

It works pretty well! I will create an open source repo, with some code which ingests all these PDFs and creates datasets for people to work :)

1 year ago 2 0 0 0

. @tensorlake.bsky.social is pretty good! Here is our Document Ingestion API working on a scanned page from the recently released Kennedy assassination classified reports.

Head over to tensorlake.ai for more! (The product is still in early preview)

1 year ago 6 1 0 0

Test prompts instantly with our intuitive JSON schema editor and extract complex tables, figures, and charts with ease.

Once your schema is set, run our API at scale—tens of thousands of times per day—ingesting new documents and writing structured data directly into your database or warehouse.

1 year ago 4 0 0 0
Video

Structured Extraction is essential for AI engineering teams, we are now making it faster and more reliable than ever, whether you're turning PDFs, invoices, or reports into structured data.
Here is a sneak peak into our Structured Extraction engine.

1 year ago 12 2 1 0

I don’t live in New York but will come out just for that :)

1 year ago 7 0 0 0

Does Dagster have a way of packaging workflows into artifacts along with dependencies? Looks like in your screenshot you are importing requests. Do you build some docker images out of band and make sure the workflow functions run on those images?

1 year ago 2 0 0 0
Advertisement

This means we won’t need a dynamodb table anymore right?

1 year ago 4 0 1 0

Meant creating an artifact which includes the system and python dependencies. I have seen some tools copying code into a container with system dependencies, which others pickling the code and putting it into a container.

1 year ago 0 0 2 0

Python Folks - which data/workflow engine has the best developer experience for packaging code? We have looked into - Modal, Beam, Airflow, Flyte, AWS Lambda, Prefect, Dagster and Spark. Haven’t seen any approach which is fast, reliable and intuitive.

1 year ago 10 2 6 0

Taking a break for 10 days for the first time since December last year! January is going to be great and you will hear about @tensorlake.bsky.social more often :)

1 year ago 6 0 0 0

We have been using O1 or Sonnet to solve a problem to understand the upper bounds of what models are capable of, and falling back to our internal models or open source models for economy and security. Been working pretty well, is this a common workflow?

1 year ago 4 1 0 0

I know man. I poured my heart out on nomad back in the day. It was a ton of hard work. Thanks for appreciating :)

1 year ago 1 0 1 0
Post image

At the Hasicorp ReInvent party, no mention of Nomad and Consul 😭

1 year ago 5 0 1 0

Landed in Vegas for reinvent! Say hello if you are around, would love to chat :)

1 year ago 2 0 0 0

Turned on Apple Intelligence this morning. We are a long way from having a personal assistant on the iPhone!

It wish it summarized all unread from Slack, Gmail, WhatsApp and messages and came up with a list of things I needed to respond :)

1 year ago 3 0 1 0

This is why you see Anthropic and OpenAI go up the stack and integrating document sources and building agentic frameworks and building a better chat/workspace product.

1 year ago 0 0 0 0
Advertisement

Alibaba has done an amazing job with open source models. At this point, the difference between @Alibaba_Qwen and closed vendors is just the product on top of models.

1 year ago 1 0 1 0

Assuming you have a great document segmentation model (which is a big assumption) it's going to do the job better than anything else out there.

1 year ago 2 0 0 0

Qwen2VL 72B is just better than every other closed and open source vision model for document understanding.

Like every other vision model, it's still incapable of retaining every single ground truth on dense documents.

1 year ago 3 0 1 0