Source

Towards Data Science

242 posts from this source

Sort:

1 month ago · ai · - · -

Optimizing Vector Search: Why You Should Flatten Structured Data

An analysis of how flattening structured data can boost precision and recall by up to 20% The post Optimizing Vector Search: Why You Should Flatten Structured D...

#vector search #data flattening #structured data #precision #recall #embeddings #similarity search
1 month ago · ai · - · -

RoPE, Clearly Explained

Going beyond the math to build intuition The post RoPE, Clearly Explained appeared first on Towards Data Science....

#RoPE #rotary positional embedding #transformers #positional encoding #LLM #deep learning
1 month ago · ai · - · -

The Unbearable Lightness of Coding

Confessions of a vibe coder The post The Unbearable Lightness of Coding appeared first on Towards Data Science....

#ai #data-science #tutorial
1 month ago · ai · - · -

Randomization Works in Experiments, Even Without Balance

Randomization usually balances confounders in experiments, but what happens when it doesn't? The post Randomization Works in Experiments, Even Without Balance a...

#ai #data-science #tutorial
1 month ago · ai · - · -

Going Beyond the Context Window: Recursive Language Models in Action

Explore a practical approach to analysing massive datasets with LLMs The post Going Beyond the Context Window: Recursive Language Models in Action appeared firs...

#large-language-models #context-window #recursive-models #data-analysis #machine-learning
1 month ago · education · - · -

Data Science as Engineering: Foundations, Education, and Professional Identity

Recognize data science as an engineering practice and structure education accordingly. The post Data Science as Engineering: Foundations, Education, and Profess...

#data science #engineering #education #professional identity #foundations
1 month ago · ai · - · -

From Connections to Meaning: Why Heterogeneous Graph Transformers (HGT) Change Demand Forecasting

How relationship-aware graphs turn connected forecasts into operational insight The post From Connections to Meaning: Why Heterogeneous Graph Transformers HGT C...

#heterogeneous graph transformer #graph neural networks #demand forecasting #machine learning #time series analysis
1 month ago · software · - · -

Layered Architecture for Building Readable, Robust, and Extensible Apps

If adding a feature feels like open-heart surgery on your codebase, the problem isn’t bugs, it’s structure. This article shows how better architecture reduces r...

#software-architecture #layered-architecture #clean-code #design-patterns #app-development
1 month ago · ai · - · -

You Probably Don’t Need a Vector Database for Your RAG — Yet

Numpy or SciKit-Learn might meet all your retrieval needs The post You Probably Don’t Need a Vector Database for Your RAG — Yet appeared first on Towards Data S...

#vector database #RAG #retrieval augmented generation #embeddings #LLM #machine learning #data science
1 month ago · software · - · -

Why Package Installs Are Slow (And How to Fix It)

How sharded indexing patterns solve a scaling problem in package management The post Why Package Installs Are Slow And How to Fix It appeared first on Towards D...

#package-management #installation-performance #dependency-resolution #software-tools
1 month ago · ai · - · -

Bridging the Gap Between Research and Readability with Marco Hening Tallarico

Diluting complex research, spotting silent data leaks, and why the best way to learn is often backwards. The post Bridging the Gap Between Research and Readabil...

#research communication #readability #data science #science writing #knowledge translation
1 month ago · ai · - · -

Using Local LLMs to Discover High-Performance Algorithms

How I used open-source models to explore new frontiers in efficient code generation, using my MacBook and local LLMs. The post Using Local LLMs to Discover High...

#ai #data-science #tutorial
1 month ago · ai · - · -

Time Series Isn’t Enough: How Graph Neural Networks Change Demand Forecasting

Why modeling SKUs as a network reveals what traditional forecasts miss The post Time Series Isn’t Enough: How Graph Neural Networks Change Demand Forecasting ap...

#graph neural networks #demand forecasting #time series #supply chain analytics #machine learning #deep learning
1 month ago · ai · - · -

The Hidden Opportunity in AI Workflow Automation with n8n for Low-Tech Companies

How to use n8n with multimodal AI and optimisation tools to help companies with low data maturity accelerate their digital transformation. The post The Hidden O...

#ai #data-science #tutorial
1 month ago · ai · - · -

Why Healthcare Leads in Knowledge Graphs

How science, regulation, collaboration, and public funding shaped the world’s most mature semantic infrastructure The post Why Healthcare Leads in Knowledge Gra...

#knowledge graphs #healthcare data #semantic web #ontologies #AI infrastructure #medical AI #data integration
1 month ago · ai · - · -

Data Poisoning in Machine Learning: Why and How People Manipulate Training Data

Do you know where your data has been? The post Data Poisoning in Machine Learning: Why and How People Manipulate Training Data appeared first on Towards Data Sc...

#data poisoning #machine learning security #adversarial attacks #training data manipulation #AI safety
1 month ago · ai · - · -

A Geometric Method to Spot Hallucinations Without an LLM Judge

Imagine a flock of birds in flight. There’s no leader. No central command. Each bird aligns with its neighbors—matching direction, adjusting speed, maintaining...

#hallucination detection #LLM evaluation #geometric method #AI safety #natural language processing
1 month ago · software · - · -

Maximum-Effiency Coding Setup

Learn how to be a more efficient programmer The post Maximum-Effiency Coding Setup appeared first on Towards Data Science....

#programming productivity #coding setup #developer tools #efficiency #workflow
1 month ago · ai · - · -

Cutting LLM Memory by 84%: A Deep Dive into Fused Kernels

Why your final LLM layer is OOMing and how to fix it with a custom Triton kernel. The post Cutting LLM Memory by 84%: A Deep Dive into Fused Kernels appeared fi...

#LLM #memory optimization #fused kernels #Triton #GPU performance #deep learning #model inference
1 month ago · ai · - · -

From RGB to Lab: Addressing Color Artifacts in AI Image Compositing

A multi-tier approach to segmentation, color correction, and domain-specific enhancement The post From RGB to Lab: Addressing Color Artifacts in AI Image Compos...

#image compositing #color correction #RGB #Lab color space #segmentation #computer vision #deep learning #AI image processing
1 month ago · ai · - · -

The Great Data Closure: Why Databricks and Snowflake Are Hitting Their Ceiling

Acquisitions, venture, and an increasingly competitive landscape all point to a market ceiling The post The Great Data Closure: Why Databricks and Snowflake Are...

#ai #data-science #tutorial
1 month ago · ai · - · -

When Shapley Values Break: A Guide to Robust Model Explainability

Shapley Values are one of the most common methods for explainability, yet they can be misleading. Discover how to overcome these limitations to achieve better i...

#shapley-values #model-explainability #interpretability #machine-learning #robustness
1 month ago · ai · - · -

How to Run Coding Agents in Parallel

Get the most out of Claude Code The post How to Run Coding Agents in Parallel appeared first on Towards Data Science....

#coding agents #parallel execution #Claude #LLM automation #AI programming
1 month ago · software · - · -

The 2026 Goal Tracker: How I Built a Data-Driven Vision Board Using Python, Streamlit, and Neon

Designing a centralized system to track daily habits and long-term goals The post The 2026 Goal Tracker: How I Built a Data-Driven Vision Board Using Python, St...

#python #streamlit #neon #data-visualization #habit-tracking #goal-tracking #web-app #vision-board

Newer posts

Older posts