Data Science Tech Brief By HackerNoon

By HackerNoon

Listen to a podcast, please open Podcast Republic app. Available on Google Play Store and Apple App Store.

Image by HackerNoon

Category: Tech News

Open in Apple Podcasts


Open RSS feed


Open Website


Rate for this podcast

Subscribers: 4
Reviews: 0
Episodes: 99

Description

Learn the latest data science updates in the tech world.

Episode Date
How We Built a Per-Plant CO2 Dataset for 4,551 Power Stations Worldwide
Jun 25, 2026
Eliminating Data Latency with Event-Driven Pipelines at Enterprise Scale
Jun 25, 2026
Scaling Self-Service Analytics in Regulated Banking With Metadata-Driven Design
Jun 23, 2026
How to Rotate Proxies Without Breaking Login Sessions
Jun 23, 2026
I Built an Open-Source Firebase Analytics Alternative Because I Hit 1M Events/Day Once Too Many
Jun 20, 2026
Your Redshift Cluster Is Probably Idle 85% of the Time — And You're Paying for All of It
Jun 20, 2026
What the Real Operating Data on AI Agents Tells Me as an Investor
Jun 18, 2026
Building Data Quality Into the Pipeline Instead of Cleaning Up After It
Jun 17, 2026
Why Speed Matters: How Performance in Analytics Saves Business from "Digital Paralysis"
Jun 17, 2026
Open Data Is Not a Product. Here's What It Takes to Make It One.
Jun 12, 2026
Why Scrapers Fail: Headers, Sessions, IP Reputation, and Request Patterns
Jun 11, 2026
I Built an AI-Assisted Data Quality Layer for Operations Dashboards
Jun 03, 2026
The Source Code Isn't Hidden - You Just Gotta Refocus Your Lens
Jun 03, 2026
Why Your Data Governance Framework Is Failing (And What You Can Do About It)
Jun 02, 2026
The Cloud Data Leak: Architecting SQL to Stop Financial Bleeding
Jun 02, 2026
Principal Components Analysis in TypeScript (Part 4): Turning PCA Into Interpretable Factor Analysis
May 30, 2026
Data Engineering Teams Need a Different Version of Agile
May 28, 2026
The LLM Veneer: When AI Sounds Smart but Has Nothing Real to Reason Over
May 27, 2026
Bad Ingestion Architecture Generates Million Dollar Snowflake and Databricks Bills
May 22, 2026
Optimizing Distributed Data Processing for ML at Scale
May 21, 2026
Why Finance Data Quality Needs Rule Engines, Not ML Hype
May 21, 2026
156 Blog Posts To Learn About Business Intelligence
May 20, 2026
Why Your Marketplace Scraper Keeps Getting Blocked (And Why It’s Not a Code Problem)
May 19, 2026
How I Decoded My Apple Watch Metrics: Taking a Look At The Raw Numbers (Part 2)
May 09, 2026
Why AI Agents Are Creating a New Kind of Data Engineer
May 09, 2026
The Architectural Limits of Data Lakes and the Rise of Lakehouses
May 08, 2026
The Economic Case for Investing in Youth Education
May 07, 2026
HiveMQ and TimescaleDB: It Just Works!
May 07, 2026
102 Blog Posts To Learn About Datasets
May 06, 2026
Why More Data Doesn’t Guarantee Better Insights in Modern Data Systems
May 06, 2026
500 Blog Posts To Learn About Data
May 05, 2026
228 Blog Posts To Learn About Data Visualization
May 05, 2026
The Hard Lessons of Managing a Data Science Team
May 04, 2026
95 Blog Posts To Learn About Data Storage
May 04, 2026
70 Blog Posts To Learn About Data Scraping
May 03, 2026
500 Blog Posts To Learn About Data Science
May 03, 2026
110 Blog Posts To Learn About Data Management
May 02, 2026
402 Blog Posts To Learn About Data Analytics
May 01, 2026
50 Blog Posts To Learn About Data Collection
May 01, 2026
427 Blog Posts To Learn About Data Analysis
Apr 30, 2026
Your Dashboard Isn’t Wrong - Your KPI Logic Is
Apr 29, 2026
The Hidden Cost of Scraping Everything (and Why Datasets Win)
Apr 28, 2026
500 Blog Posts To Learn About Big Data
Apr 28, 2026
263 Blog Posts To Learn About Analytics
Apr 27, 2026
They Got Lost in the Transformer, Episode 1: What Even Is an Embedding?
Apr 24, 2026
Kafka vs Azure Event Hubs: The Tradeoffs You Only See in Production
Apr 24, 2026
Clarifying the Difference Between Data Strategy, Analytics, and AI Governance
Feb 06, 2026
The “Store Everything” Cloud Model Is Breaking Under Modern AI Workloads
Feb 06, 2026
AI Belongs Inside DataOps, Not Just at the End of the Pipeline
Feb 05, 2026
Stop Torturing Your Data: How to Automate Rigor With AI
Feb 04, 2026
Minimum Incident Lineage (MIL): A Run-Level Evidence Standard for Reproducible Data Incidents
Feb 04, 2026
5 Ways Spark 4.1 Moves Data Engineering From Manual Pipelines to Intent-Driven Design
Feb 03, 2026
Beyond Prediction: Econometric Data Science for Measuring True Business Impact
Feb 03, 2026
Designing Economic Intelligence: Econometrics-First Approaches in Data Science
Jan 31, 2026
From Forecasting to BI: Inside Shravanthi Ashwin Kumar’s Data-Driven Finance Playbook
Jan 30, 2026
Causal Thinking in the Age of Big Data: Modern Econometrics for Data Scientists
Jan 27, 2026
Data Pipeline Testing: The 3 Levels Most Teams Miss
Jan 27, 2026
HSM: The Original Tiering Engine Behind Mainframes, Cloud, and S3
Jan 25, 2026
Navigating Architectural Trade-offs at Scale to Meet AI Goals in 2026
Jan 23, 2026
Will AI Take Your Job? The Data Tells a Very Different Story
Jan 23, 2026
You Don’t Need an API for Everything (Sometimes Scraping Is Enough)
Jan 22, 2026
How to Use Propensity Score Matching to Measure Down Stream Causal Impact of an Event
Jan 22, 2026
How to Analyze Call Sentiment With Open-Source NLP Libraries
Jan 21, 2026
How Bayesian Tail-Risk Modeling can save your Retail Business Marketing Budget
Jan 20, 2026
Architecting Trustworthy Healthcare Data Platforms Using Declarative Pipelines
Jan 20, 2026
When A/B Tests Aren’t Possible, Causal Inference Can Still Measure Marketing Impact
Jan 14, 2026
Why Data Quality Is Becoming a Core Developer Experience Metric
Jan 13, 2026
Why “Accuracy” Fails for Uplift Models (and What to Use Instead)
Jan 11, 2026
Turning Your Data Swamp into Gold: A Developer’s Guide to NLP on Legacy Logs
Dec 18, 2025
Data Monetization Strategies in Government Digital Platforms
Dec 17, 2025
Why Partner Data Became My Toughest Engineering Problem
Dec 16, 2025
PBIX Is Not Going Away - But PowerBI Will Never Work the Same Again
Dec 16, 2025
Smart Fire Protection: How AI Is Changing Preventive Maintenance Forever
Dec 06, 2025
Why More VARs and SIs Are Embedding Melissa Into Their Enterprise Solutions
Dec 06, 2025
Big Data as the New Compass of Competition
Dec 04, 2025
Srilatha Samala’s Agile Intelligence Approach to Enterprise Reporting as a Strategic Asset
Dec 03, 2025
The Hidden Cost of Bad Data: Why It’s Undermining Your AI Strategy
Dec 03, 2025
Data Platform as a Service: A Three-Pillar Model for Scaling Enterprise Data Systems
Nov 20, 2025
How RAG Improves Database Management
Nov 20, 2025
How To Power AI, Analytics, and Microservices Using the Same Data
Nov 19, 2025
From Data Fragmentation to Billion-Dollar Insights: The Vision of Manish Ravindra Sharath
Oct 30, 2025
Building a Layered Defense Against Web Scraping
Oct 30, 2025
Cosmo: The Graph Visualization Tool Built for Your Terminal
Oct 23, 2025
How Businesses Are Turning Space Data into a Tool for Risk, Resilience, and Sustainability
Oct 15, 2025
How Data Innovation Changed a State’s Infrastructure Engine
Oct 10, 2025
How to Optimize Your Marketing Budget Using Just Three Letters: MMM
Sep 25, 2025
Here's How ShareChat Scaled Their ML Feature Store 1000X Without Scaling the Database
Sep 25, 2025
Why You Shouldn’t Judge by PnL Alone
Sep 24, 2025
From "Decentralized" to "Unified": SUPCON Uses SeaTunnel to Build an Efficient Data Collection Frame
Sep 23, 2025
Enterprise Data Pipeline Revolution: Suresh Palli's Metadata-Driven Automation Success
Sep 19, 2025
Unified Data, Smarter Agents—Is Your Architecture Future-Proof?
Sep 18, 2025
Data-Driven Decisions at Scale: A/B Testing Best Practices for Engineering & Data Science Teams
Sep 18, 2025
Why You Should (Almost) Always Choose Sync Gunicorn Workers
Sep 17, 2025
Beyond the Ten Blue Links: How Generative AI Rewires Our Brains for Search
Sep 16, 2025
Need Web Data? Here Are the 3 Methods Everyone’s Using
Sep 16, 2025
Applying Transitive Closure to Sort Products Into Categories, Considering Nesting and Overlaps
Sep 15, 2025
98% of Data Strategies Fail: Let's Fix It
Aug 02, 2024
How To Measure The Results Of In-App Events When Onelinks Don’t Work
Jul 30, 2024
How AI-Powered Data Mapping is Democratizing Data Management
Jul 27, 2024
Data Engineering: What’s the Value of API Security in the Generative AI Era?
Jul 27, 2024