data science

Methods to Construct an AI-Powered Weather ETL Pipeline with Databricks and GPT-4o: From API To Dashboard

, Databricks has shaken the information market once more. The corporate launched its free edition of the Databricks platform It's an incredible resource for learning and testing, to say the least. With that in mind,...

The Machine Learning “Advent Calendar” Day 24: Transformers for Text in Excel

of my Machine Learning Advent Calendar. Before closing this series, I would really like to sincerely thank everyone who followed it, shared feedback, and supported it, specifically the Towards Data Science team. Ending this calendar...

4 Techniques to Optimize AI Coding Efficiency

In a previous article, I of an important techniques I utilize to code effectively with AI agents. In this text, I’m continuing with 4 additional techniques, all of which I exploit each day. I imagine...

Is Your Model Time-Blind? The Case for Cyclical Feature Encoding

: The Midnight Paradox Imagine this. You’re constructing a model to predict electricity demand or taxi pickups. So, you feed it time (corresponding to minutes) starting at midnight. Clean and easy. Right? Now your model sees...

Bonferroni vs. Benjamini-Hochberg: Selecting Your P-Value Correction

be a sensitive topic. Perhaps best avoided on first encounter with a Statistician. The disposition toward the subject has led to a tacit agreement that α = 0.05 is the gold standard—in fact,...

The Machine Learning “Advent Calendar” Day 23: CNN in Excel

were first introduced for images, and for images they are sometimes easy to know. A filter slides over pixels and detects edges, shapes, or textures. You possibly can read this text I wrote earlier...

Stop Retraining Blindly: Use PSI to Construct a Smarter Monitoring Pipeline

, cleaned the information, made a number of transformations, modeled it, after which deployed your model to be utilized by the client.  That’s a whole lot of work for an information scientist. However the job...

The Machine Learning “Advent Calendar” Day 20: Gradient Boosted Linear Regression in Excel

, we ensemble learning with voting, bagging and Random Forest. Voting itself is simply an aggregation mechanism. It doesn't create diversity, but combines predictions from already different models.Bagging, however, explicitly creates diversity by training...

Recent posts

Popular categories

ASK ANA