InGenerative AIbyFabio ChiusanoBuilding a Knowledge Base from Texts: a Full Practical ExampleImplementing a pipeline for extracting a Knowledge Base from texts or online articlesMay 24, 202210May 24, 202210
Cory MaklinThe Star Schema Is Dead Long Live Wide TablesIf you do a quick search for books data engineer must read, you’ll find two books: 1. Designing Data-Intensive Applications 2. The Data…May 29, 202315May 29, 202315
InTDS ArchivebyDr. Robert KüblerHands-On Deep Q-LearningLevel up your agent to win more difficult games!Nov 25, 20233Nov 25, 20233
InShelf EngineeringbyDmytro HarazdovskiyHow did we make Postgres perform reads 2x faster?It was easy to manage data up to 10–20 million rows in one table but after 50🍋 you face new challenges. Queries usually were doing up to….Nov 16, 202317Nov 16, 202317
Ankush SinghHow To: Understand Apache ArrowIn the world of Big Data and data science, the need for efficient, high-performance data processing frameworks is more crucial than ever…Jun 4, 2023Jun 4, 2023
InOrdinaryIndustriesbyJack FieldsThe Ultimate VS Code Setup for PythonHaving a well-optimized setup for your Visual Studio Code (VS Code) is nothing short of a game-changer when it comes to unleashing your…Jul 28, 202310Jul 28, 202310
DataBeansZ-ordering: take the Guesswork out“With great power comes great responsibility” -SpidermanSep 19, 20221Sep 19, 20221
Gurpreet SinghExploring Key Distributed System Algorithms and Concepts Series: 8— Quad tree and Geohash.Quad treeOct 1, 2023Oct 1, 2023
Analytics at MetaData engineering at Meta: High-Level Overview of the internal tech stackThis article provides an overview of the internal tech stack that we use on a daily basis as data engineers at Meta. The idea is to shed…Oct 10, 202331Oct 10, 202331
InPython in Plain EnglishbySerop BaghdadlianFive Python Decorators That Can Reduce Your Code By HalfUpgrade your Python game by using these wrappers for maximum efficiency and readability.May 29, 202320May 29, 202320
InCreative DatabyPatrick PichlerDistributed Query Engines vs. Data Lake EnginesThe evolution from SQL-based query engines for big data to data lake engines including its impact on data warehouses and data lakesNov 24, 2020Nov 24, 2020