Data Engineering

!dataengineering

@lemm.ee
Create post
Citus Data - Distributed Postgres

Citus Data - Distributed Postgres

Open link in next tab

Citus Data | Distributed Postgres. At any scale.

https://www.citusdata.com/

Citus gives you all the greatness of Postgres plus the superpowers of distributed tables. By distributing your data and queries, your application gets high performance—at any scale. The Citus database is available as open source and as a managed service with Azure Cosmos DB for PostgreSQL.

Citus Data | Distributed Postgres. At any scale.
Cloud Backed SQLite

Cloud Backed SQLite

Open link in next tab

Cloud Backed SQLite: Cloud Backed SQLite

https://sqlite.org/cloudsqlite/doc/trunk/www/index.wiki

Apache Arrow
Open link in next tab

Apache Arrow

https://arrow.apache.org/

A cross-language development platform for in-memory analytics

Apache Arrow
Data-Oriented Design (2018)

Data-Oriented Design (2018)

Open link in next tab

https://www.dataorienteddesign.com/dodbook/dodmain.html

CAP Theorem Simplified

CAP Theorem Simplified

Open link in next tab

CAP Theorem Simplified

https://youtu.be/BHqjEjzAicA

Subscribe to our weekly system design newsletter: https://bit.ly/3tfAlYDCheckout our bestselling System Design Interview books: Volume 1: https://amzn.to/3Ou...

Introducing English as the New Programming Language for Apache Spark

Introducing English as the New Programming Language for Apache Spark

Open link in next tab

Introducing English as the New Programming Language for Apache Spark

https://www.databricks.com/blog/introducing-english-new-programming-language-apache-spark

Introduction

Introducing English as the New Programming Language for Apache Spark
What is Data Lineage?

What is Data Lineage?

Open link in next tab

Data Lineage: The Unseen Lifeline of Data-Driven Organizations | Airbyte

https://airbyte.com/blog/what-is-data-lineage

Data Lineage: The Unseen Lifeline of Data-Driven Organizations | Airbyte
Design Thinking Bootleg (Stanford)

Design Thinking Bootleg (Stanford)

Open link in next tab

Design Thinking Bootleg — Stanford d.school

https://dschool.stanford.edu/resources/design-thinking-bootleg

The Design Thinking Bootleg is a set of tools and methods that we keep in our back pockets, and now you can do the same.

Design Thinking Bootleg  — Stanford d.school
Array programming with NumPy

Array programming with NumPy

Open link in next tab

Array programming with NumPy - Nature

https://www.nature.com/articles/s41586-020-2649-2

NumPy is the primary array programming library for Python; here its fundamental concepts are reviewed and its evolution into a flexible interoperability layer between increasingly specialized computational libraries is discussed.

Array programming with NumPy - Nature
The Missing Semester of Your CS Education

The Missing Semester of Your CS Education

Open link in next tab

The Missing Semester of Your CS Education

https://missing.csail.mit.edu/