Discussion about this post

User's avatar
Chris Kornaros's avatar

I think you’re just describing the natural evolution of the Data Engineering role. The field has dealt with unstructured data for a while now, it’s just that using LLMs to parse/document is newer. To me, that’s just utilization of a new tool, in the same way Airflow is more recent than Cron.

In my experience, there are a lot of data engineering roles that want or require unstructured/NoSQL. Even if you aren’t working with video on the scale of Netflix, dealing with PDFs is fundamentally similar (clearly not the same).

Expand full comment
Arvind Patil's avatar

Very engaging post!

Expand full comment
2 more comments...

No posts