Discussion about this post

User's avatar
Chris Kornaros's avatar

I think you’re just describing the natural evolution of the Data Engineering role. The field has dealt with unstructured data for a while now, it’s just that using LLMs to parse/document is newer. To me, that’s just utilization of a new tool, in the same way Airflow is more recent than Cron.

In my experience, there are a lot of data engineering roles that want or require unstructured/NoSQL. Even if you aren’t working with video on the scale of Netflix, dealing with PDFs is fundamentally similar (clearly not the same).

Arvind Patil's avatar

Very engaging post!

3 more comments...

No posts

Ready for more?