Arrow Flight is more than “just a file format.” In this quick walkthrough, I break down what Flight is, why gRPC makes it perfect for streaming Arrow record batches, and how the core RPC methods work (ListFlights + DoGet).
Then we build a tiny Python Flight server that:
Lists Parquet files in a public S3 bucket
Uses Flight methods to return schemas and stream data back to a local client
If you want a fast mental model for Flight, plus a practical “hello world” you can adapt for real services, this is it.
🔗 Useful Links:
Read more about the Apache Arrow ecosystem HERE
Hi, my name is Hoyt. I’ve spent different lives in Marketing, Data Science and Data Product Management. Other than this Substack, I am the founder of Early Signal. I help data tech startups build authentic connections with technical audiences through bespoke technical content and intentional distribution. Are you an early stage start up or solopreneur wanting to get creative with your technical content and distribution strategy? Let’s talk!









