Latest Posts

188 posts from all your feeds

7 AI Tools I Can’t Live Without as a Professional Data Scientist

A professional data scientist shares 7 essential AI tools that have revolutionized his workflow, covering everything from writing and research to coding, data analysis, and local model experimentation.

www.kdnuggets.com

Nov 27, 2025•3 months ago•8 min read

+10 more

A Complete Guide to Seaborn

This comprehensive guide details intermediate to advanced Seaborn usage for Python data visualization. It covers setting themes, creating various plot types (relational, categorical, distribution, regression), using grid layouts, visualizing correlations, and fine-tuning plots with Matplotlib hooks for clear, accurate communication.

www.kdnuggets.com

Oct 8, 2025•4 months ago•11 min read

+11 more

Statistics at the Command Line for Beginner Data Scientists

You don’t need Python or R to start working with data. This guide walks you through using built-in Unix utilities for real statistical analysis.

www.kdnuggets.com

Dec 8, 2025•2 months ago•

10 Polars One-Liners for Speeding Up Data Workflows

This article presents 10 concise Polars one-liners to accelerate data manipulation workflows. It covers key operations like lazy loading, filtering, and aggregation, positioning Polars as a high-performance alternative to Pandas.

www.kdnuggets.com

Nov 6, 2025•3 months ago•4 min read

+4 more

7 Best Chrome Extensions for Agentic AI

This article showcases 7 agentic AI Chrome extensions, contrasting them with reactive AI. It details tools like Magical, Merlin, Zapier Agents, Recall, BrowserAgent, Taskade AI, and Perplexity AI for automating workflows, enhancing research, and boosting browser productivity.

www.kdnuggets.com

Oct 22, 2025•4 months ago•5 min read

+8 more

Building a Gmail Inbox Management Agent in n8n

Learn to build an automated Gmail inbox management agent using n8n. This agent scores incoming emails by sender, content, and category, then routes them to priority-specific actions like labeling, creating tasks, or sending Slack alerts.

www.kdnuggets.com

Nov 11, 2025•3 months ago•7 min read

+11 more

Building AI Automations with Google Opal

Google Opal is a new, experimental no-code tool that translates natural language prompts into visual AI workflows. It empowers both technical and non-technical users to build and share custom AI micro-applications using Google's models.

www.kdnuggets.com

Nov 14, 2025•3 months ago•7 min read

+6 more

The Data Detox: Training Yourself for the Messy, Noisy, Real World

In this article, we’ll use a real-life data project to explore four practical steps for preparing to deal with messy, real-life datasets.

www.kdnuggets.com

Dec 15, 2025•2 months ago•

Free AI and Data Courses with 365 Data Science— 100% Unlimited Access until Nov 21

365 Data Science offers free, unlimited access to its entire AI and data science learning platform from Nov 6-21, 2025. This initiative provides courses, hands-on projects, and industry-recognized certifications for aspiring tech professionals.

www.kdnuggets.com

Nov 6, 2025•3 months ago•2 min read

+5 more

5 Practical Examples for ChatGPT Agents

This article explores 5 practical examples of ChatGPT Agents moving beyond conversation to real-world action. It covers automating data cleaning, managing AI customer support, streamlining content production, building research assistants, and orchestrating DevOps. These agents integrate with APIs to automate complex workflows, enhancing efficiency across various sectors.

www.kdnuggets.com

Oct 17, 2025•4 months ago•5 min read

+7 more

5 Top AI-Powered App Builders

Take a tour of 5 of the most popular AI-powered app builders out there to leverage automation in the process of building software.

www.kdnuggets.com

Dec 18, 2025•2 months ago•

5 Fun Data Science Projects for Absolute Beginners

Learn data science by doing! This post highlights 5 video tutorials for beginners, covering the complete workflow: data cleaning, exploratory analysis, visualization with Plotly, feature engineering, and deploying a model with Streamlit.

www.kdnuggets.com

Nov 3, 2025•3 months ago•3 min read

+7 more

5 Fun Docker Projects for Absolute Beginners

Learn Docker by doing with five beginner-friendly projects covering hosting, multi-container apps, CI, and monitoring.

www.kdnuggets.com

Dec 26, 2025•2 months ago•

The Best Web Scraping APIs for AI Models in 2026

This post compares top web scraping APIs (Bright Data, Oxylabs, ScraperAPI, Apify) essential for AI models in 2026. It evaluates them on dynamic content support, scalability, anti-bot, structured output, and integration, identifying Bright Data as the top choice.

www.kdnuggets.com

Dec 7, 2025•2 months ago•3 min read

+8 more

The Algorithmic X-Men

This article creatively explains seven essential machine learning algorithms by comparing their strengths and weaknesses to iconic X-Men characters, from Wolverine as a Decision Tree to Jean Grey as a Neural Network.

www.kdnuggets.com

Sep 29, 2025•5 months ago•4 min read

+3 more

How I Built a Data Cleaning Pipeline Using One Messy DoorDash Dataset

This article details building a data cleaning pipeline for a messy DoorDash dataset. It covers data exploration, fixing datetime types, imputing missing `store_primary_category` values via a smart `mode()`-based strategy, and dropping remaining NaNs for analysis.

www.kdnuggets.com

Oct 16, 2025•4 months ago•4 min read

+6 more

Prompt Engineering for Outlier Detection

Learn how to detect outliers by doing a real-life data project and improve the process with AI.

www.kdnuggets.com

Dec 9, 2025•2 months ago•

TPOT: Automating ML Pipelines with Genetic Algorithms in Python

TPOT (Tree-based Pipeline Optimization Tool) automates machine learning pipeline creation in Python using genetic algorithms. It simplifies tasks like data cleaning, algorithm selection, and hyperparameter tuning by evolving optimal pipelines over generations, saving significant time.

www.kdnuggets.com

Dec 9, 2025•2 months ago•3 min read

+7 more

The 5 FREE Must-Read Books for Every AI Engineer

This article highlights five essential, free resources for AI engineers. The recommendations span foundational texts on neural networks, a comprehensive deep learning book, a practical course, a book on agent-based AI, and a paper on ethics.

www.kdnuggets.com

Nov 12, 2025•3 months ago•4 min read

+4 more

7 Steps to Mastering Data Storytelling for Business Impact

Master data storytelling with a 7-step guide. This workflow transforms complex analysis into clear, actionable business insights by defining questions, knowing your audience, choosing metrics, and crafting a compelling narrative.

www.kdnuggets.com

Nov 10, 2025•3 months ago•3 min read

+3 more

Decoding Agentic AI: The Rise of Autonomous Systems

Agentic AI represents the next AI frontier: autonomous systems capable of planning, acting, and self-improving. Unlike static LLMs, these agents use modular design (planning, memory, tools) and a "observe, plan, act, reflect" cycle for continuous learning and problem-solving.

www.kdnuggets.com

Nov 18, 2025•3 months ago•3 min read

+7 more

Data Analytics Automation Scripts with SQL Stored Procedures

Learn to automate data analytics tasks by encapsulating complex SQL queries into reusable stored procedures. This guide shows how to create a procedure in MySQL and call it from a Python script for streamlined workflows.

www.kdnuggets.com

Oct 14, 2025•4 months ago•3 min read

+4 more

Why Do Language Models Hallucinate?

Language model hallucinations are not a mysterious flaw but a direct result of training and evaluation methods that reward confident guessing over admitting uncertainty. The issue is rooted in simple classification errors.

www.kdnuggets.com

Sep 24, 2025•5 months ago•4 min read

+3 more

5 Useful Python Scripts for Busy Data Engineers

This article provides five practical Python scripts for data engineers to automate common operational tasks: monitoring pipeline health, validating schemas, tracking data lineage, analyzing database performance, and ensuring data quality.

www.kdnuggets.com

Nov 14, 2025•3 months ago•4 min read

+4 more

5 Docker Containers for Language Model Development

This article details five Docker container setups (NVIDIA CUDA, PyTorch, Hugging Face, Jupyter, llama.cpp/Ollama) designed to streamline language model development. They offer isolated, reproducible environments to accelerate LLM research, fine-tuning, and inference.

www.kdnuggets.com

Nov 24, 2025•3 months ago•7 min read

+14 more

How to Build Production-Ready UI Prototypes in Minutes Using Google Stitch

Google Stitch is a new AI tool that generates production-ready UI prototypes from simple text or image prompts. This guide shows how to quickly create, iterate, and export designs, accelerating app development.

www.kdnuggets.com

Sep 16, 2025•5 months ago•3 min read

+2 more

Building a Simple Data Quality DSL in Python

This post demonstrates building a simple Python DSL for data quality validation using Pandas. It focuses on creating readable, maintainable rules, separating business logic from error handling, offering reusable components and extensibility.

www.kdnuggets.com

Dec 1, 2025•2 months ago•8 min read

+8 more

5 Strategic Steps to a Seamless AI Integration

This article provides a five-step strategic roadmap for businesses to successfully integrate AI. It emphasizes starting with a clear purpose, building a strong data foundation, training staff, scaling smartly, and embedding ethics.

www.kdnuggets.com

Sep 16, 2025•5 months ago•4 min read

+3 more

5 Cutting-Edge MLOps Techniques to Watch in 2026

Explore 5 cutting-edge MLOps trends for 2026: Policy-as-Code for governance, AgentOps for AI agents, operational explainability, distributed MLOps for edge/TinyML/federated systems, and Green MLOps for sustainability. These trends are vital for scaling AI responsibly.

www.kdnuggets.com

Dec 1, 2025•3 months ago•4 min read

+14 more

5 AI-Assisted Coding Techniques Guaranteed to Save You Time

Enhance coding productivity with 5 AI-assisted techniques: context-rich prompting, dual-AI code & review, automated testing, legacy refactoring, and parallel task execution. AI handles repetitive work, letting developers focus on architecture and creativity.

www.kdnuggets.com

Oct 24, 2025•4 months ago•9 min read

+11 more

From Dataset to DataFrame to Deployed: Your First Project with Pandas & Scikit-learn

This article guides beginners through building a machine learning regression model in Python using Pandas and Scikit-learn. It covers loading and cleaning a dataset, handling missing values, creating preprocessing pipelines, training a Random Forest model to predict employee income, evaluating its performance, and saving the model for future deployment.

www.kdnuggets.com

Nov 7, 2025•3 months ago•4 min read

+9 more

Mapping the AI Education Surge: Which States and Schools Are Leading the Pack in 2025

A 2025 report reveals a massive surge in US AI degree programs, with a 167% increase in Master's degrees since 2022. The South, led by Texas, now leads the nation, signaling a decentralization of AI education beyond tech hubs.

www.kdnuggets.com

Nov 10, 2025•3 months ago•2 min read

+3 more

Automating Web Search Data Collection for AI Models with SerpApi

This sponsored post highlights SerpApi, a tool that automates data collection from search engines. It solves web scraping challenges by providing a simple API that returns structured JSON data for use in AI models and analytics.

www.kdnuggets.com

Nov 5, 2025•3 months ago•4 min read

+4 more

How To Set Business Goals You’ll Actually Reach (Sponsored)

What you need is a system to support the formation of goals within a structure that enables turning these broad ambitions into concrete, achievable targets. This article will provide a simple three-step framework to do so.

www.kdnuggets.com

Oct 21, 2025•4 months ago•

5 Emerging Trends in Data Engineering for 2026

Looking ahead to 2026, the most impactful trends are not flashy frameworks but structural changes in how data pipelines are designed, owned, and operated.

www.kdnuggets.com

Dec 23, 2025•2 months ago•

What To Look For In A Cloud Services Provider (Sponsored)

Choosing a cloud services provider can feel a lot like dating: every vendor promises reliability, security, and support, but only a few truly live up to it. The wrong choice can lead to costly downtime, security headaches, or performance bottlenecks that ripple across your business.

bit.ly

Jan 6, 2026•1 month ago•

Unlock Business Value: Build a Data & Analytics Strategy That Delivers

Gartner's guide helps data leaders build business-driven D&A strategies, moving beyond vague "data-driven" goals. It introduces the DASOM framework and actionable steps to align D&A with business objectives, ensuring tangible value delivery.

www.kdnuggets.com

Nov 19, 2025•3 months ago•1 min read

+7 more

Processing Large Datasets with Dask and Scikit-learn

Learn to efficiently process large datasets and build ML models using Dask and scikit-learn. This guide covers loading, cleaning, and transforming data with Dask's lazy computation, seamlessly integrating with scikit-learn for scalable workflows.

www.kdnuggets.com

Nov 13, 2025•3 months ago•4 min read

+6 more

Exploring Metaclasses in Python: Unleashing the Power of Class Creation

An introduction to Python metaclasses for data scientists. This post explains how metaclasses act as blueprints for classes, controlling their creation process, much like classes are blueprints for objects.

www.kdnuggets.com

Sep 30, 2025•5 months ago•3 min read

+3 more

Debunking 5 Myths About Cloud Computing for Small Business (Sponsored)

www.kdnuggets.com

Oct 1, 2025•5 months ago•

Deploy an AI Analyst in Minutes: Connect Any LLM to Any Data Source with Bag of Words

Learn to deploy an AI analyst in minutes with Bag of Words, an AI data layer that connects any LLM to SQL databases. This guide walks through a simple Docker setup to turn your data into interactive, AI-powered insights.

www.kdnuggets.com

Nov 25, 2025•3 months ago•4 min read

+4 more

The Lazy Data Scientist’s Guide to Exploratory Data Analysis

This guide promotes an efficient, or "lazy," approach to Exploratory Data Analysis (EDA) using Python automation tools like ydata-profiling and Sweetviz to handle repetitive tasks, saving time for deeper, domain-specific analysis.

www.kdnuggets.com

Oct 7, 2025•4 months ago•3 min read

+3 more

ChatLLM. An Honest Review of Our All-in-One AI Platform

ChatLLM by Abacus.AI is an all-in-one platform offering access to major AI models like GPT, Claude, and Gemini for a low monthly fee. It provides document analysis, coding, and image generation but has some notable drawbacks.

www.kdnuggets.com

Nov 7, 2025•3 months ago•4 min read

+4 more

What Does the End of GIL Mean for Python?

Python's Global Interpreter Lock (GIL) is being made optional via PEP 703, enabling true multithreading. This promises major performance boosts for CPU-bound tasks but requires developers to manage new concurrency complexities.

www.kdnuggets.com

Nov 10, 2025•3 months ago•4 min read

+4 more

5 Practical Docker Configurations

Enhance Docker efficiency with 5 key configurations: optimize caching with strategic layering, use multi-stage builds for lean & secure images, securely manage environment variables, streamline networking & volumes, and fine-tune resource allocation for predictable performance.

www.kdnuggets.com

Nov 28, 2025•3 months ago•5 min read

+11 more

Is ChatGPT Study Mode a Hidden Gem or a Gimmick?

This article explores ChatGPT's Study Mode, a feature designed for personalized learning. It weighs the benefits like interactive quizzing against drawbacks such as potential inaccuracies and over-reliance to determine its true value.

www.kdnuggets.com

Oct 3, 2025•4 months ago•4 min read

+3 more

5 Free Tools to Experiment with LLMs in Your Browser

Discover five free tools that let you run and test large language models directly in your browser without any setup.

www.kdnuggets.com

Dec 10, 2025•2 months ago•

10 Lesser-Known Python Libraries Every Data Scientist Should Be Using in 2026

Want to level up your data science toolkit? Here are some Python libraries that'll make your work easier.

www.kdnuggets.com

Dec 31, 2025•2 months ago•

Gistr: The Smart AI Notebook for Organizing Knowledge

This article explains how Gistr transforms the way data professionals interact with their most valuable asset: their accumulated knowledge.

www.kdnuggets.com

Dec 22, 2025•2 months ago•

10 Command-Line Tools Every Data Scientist Should Know

This post highlights 10 essential command-line tools for data scientists, updated for 2025. It covers utilities for data retrieval, text processing, parallel execution, and version control to enhance productivity and efficiency.

www.kdnuggets.com

Oct 8, 2025•4 months ago•6 min read

+4 more

10 Python One-Liners to Optimize Your Hugging Face Transformers Pipelines

Discover 10 Python one-liners to optimize Hugging Face Transformers pipelines. Enhance performance and efficiency with GPU acceleration, batching, half-precision, faster tokenization, truncation, and ensure reproducibility for robust NLP/LLM workflows.

www.kdnuggets.com

Sep 22, 2025•5 months ago•4 min read

+7 more

The Lazy Data Scientist’s Guide to Time Series Forecasting

This guide champions an efficient, 'lazy' approach to time series forecasting. It shows how to use modern Python libraries like Prophet, AutoARIMA, and AutoML platforms to get accurate predictions quickly, avoiding tedious manual tuning.

www.kdnuggets.com

Sep 16, 2025•5 months ago•3 min read

+2 more

Debunking 5 myths about cloud computing for small business (Sponsored)

usa.ingrammicro.com

Sep 29, 2025•5 months ago•

Top 5 Text-to-Speech Open Source Models

This article reviews the top 5 open-source Text-to-Speech (TTS) models: VibeVoice, Orpheus, Kokoro, OpenAudio S1, and XTTS-v2. It highlights their unique features, from multi-speaker podcasts to zero-shot voice cloning.

www.kdnuggets.com

Oct 29, 2025•4 months ago•4 min read

+3 more

Building Machine Learning Application with Django

A step-by-step tutorial on building a complete machine learning web application with Django. Learn to train a scikit-learn model, create a user-friendly web interface, and expose a JSON API for programmatic predictions.

www.kdnuggets.com

Sep 26, 2025•5 months ago•5 min read

+4 more

5 Lightweight Alternatives to Pandas You Should Try

Get started with five free Python libraries that let you analyze, filter, and process data faster than traditional Pandas.

www.kdnuggets.com

Dec 12, 2025•2 months ago•

Data Cleaning at the Command Line for Beginner Data Scientists

This article introduces beginner data scientists to powerful command-line tools (head, tail, wc, cut, sort, uniq, grep, sed, awk) for efficient data cleaning, transformation, and exploration of CSV files without requiring Python.

www.kdnuggets.com

Nov 20, 2025•3 months ago•7 min read

+16 more

7 LinkedIn Tricks to Get Noticed by Recruiters

This article provides seven actionable tricks for professionals, particularly in data science, to optimize their LinkedIn profiles. It emphasizes using targeted keywords, showcasing projects, and strategic engagement to attract recruiters.

www.kdnuggets.com

Oct 6, 2025•4 months ago•3 min read

+3 more

Agentic AI Coding with Google Jules

Google Jules is an asynchronous, agentic AI coding system that automates development tasks like bug fixes, updates, and testing. It integrates with GitHub, executes in secure cloud VMs, and provides transparent plans and diffs for review.

www.kdnuggets.com

Oct 27, 2025•4 months ago•8 min read

+7 more

Creating a Text to SQL App with OpenAI + FastAPI + SQLite

This guide demonstrates building a Text-to-SQL application using OpenAI, FastAPI, and SQLite. It covers setting up the project, connecting to a database, using OpenAI's API for query generation, and containerizing the app with Docker.

www.kdnuggets.com

Oct 17, 2025•4 months ago•7 min read

+7 more

10 Essential Agentic AI Interview Questions for AI Engineers

This guide provides 10 essential agentic AI interview questions for AI engineers, covering core concepts, tool integration, planning, multi-agent systems, and safety. It emphasizes practical experience, design choices, and understanding trade-offs.

www.kdnuggets.com

Oct 24, 2025•4 months ago•9 min read

+7 more

7 ChatGPT Tricks to Automate Your Data Tasks

Discover how to transform ChatGPT into a powerful data assistant. Learn 7 tricks to automate tasks like SQL generation, data cleaning, Python scripting, and report writing, turning hours of grunt work into minutes.

www.kdnuggets.com

Dec 2, 2025•2 months ago•5 min read

+4 more

How Data Engineering Can Power Manufacturing Industry Transformation

Data engineering is crucial for modern manufacturing, enabling the industry to harness vast data from IIoT and legacy systems. It helps build data pipelines for predictive maintenance, supply chain optimization, and operational efficiency.

www.kdnuggets.com

Nov 20, 2025•3 months ago•5 min read

+4 more

5 NotebookLM Tips to Make Your Day a Little Easier

This article offers five practical tips for data scientists to leverage Google's NotebookLM. It covers clustering research, integrating external AI for fact-checking, generating outlines, maintaining dynamic documentation, and refining sources.

www.kdnuggets.com

Oct 13, 2025•4 months ago•4 min read

+4 more

How To Get More Done Without Working More Hours (Sponsored)

Stop wasting time on repetitive tasks. Learn how smart business automation can boost your productivity, cut costs, and get you back hours a week.

www.kdnuggets.com

Oct 27, 2025•4 months ago•

Collecting Real-Time Data with APIs: A Hands-On Guide Using Python

A hands-on guide to collecting real-time data using APIs in Python. Learn to use the `requests` and `pandas` libraries with practical examples, from a simple user generator to the complex Eurostat statistical data API.

www.kdnuggets.com

Oct 29, 2025•4 months ago•4 min read

+4 more

The Benefits of an “Everything” Notebook in NotebookLM

This article details how data scientists can use Google's NotebookLM to create an "everything" notebook. This centralized knowledge base boosts productivity with semantic search, cross-document synthesis, and advanced querying.

www.kdnuggets.com

Nov 12, 2025•3 months ago•5 min read

+5 more

How To Use Synthetic Data To Build a Portfolio Project

This guide demonstrates how to create synthetic data using random, rule-based, simulation, and AI methods. It provides a step-by-step walkthrough for building a complete portfolio project, from model training to a Streamlit app.

www.kdnuggets.com

Sep 22, 2025•5 months ago•8 min read

+5 more

Top 5 Agentic Coding CLI Tools

This article reviews the top 5 agentic coding CLI tools: Claude Code, OpenCode, Droid, Codex CLI, and Gemini CLI. It shares personal experiences, pros, cons, and installation commands, highlighting tools for daily tasks, customization, debugging, and leveraging LLMs. Node.js is a common prerequisite.

www.kdnuggets.com

Nov 13, 2025•3 months ago•5 min read

+12 more

Top 10 Free API Providers for Data Science Projects

This article presents a curated list of 10 free APIs essential for data science projects. It categorizes them for easy access to foundational datasets, web scraping, geospatial information, financial markets, and social media data.

www.kdnuggets.com

Sep 19, 2025•5 months ago•4 min read

+5 more

The Complete Guide to Building Data Pipelines That Don’t Break

This guide details a systematic approach to building robust data pipelines, emphasizing software engineering principles. It covers validation, idempotency, schema evolution, backpressure, data quality monitoring, observability, and testing to prevent failures and reduce maintenance.

www.kdnuggets.com

Nov 11, 2025•3 months ago•7 min read

+8 more

A Gentle Introduction to TypeScript for Python Programmers

An introductory guide for Python developers learning TypeScript. It compares key concepts like type systems, classes, and functions, highlighting how TypeScript provides robust, compile-time safety that prevents common Python runtime errors.

www.kdnuggets.com

Oct 6, 2025•4 months ago•6 min read

+3 more

The 10 AI Developments That Defined 2025

In this article, we retroactively analyze what I would consider the ten most consequential, broadly impactful AI storylines of 2025, and gain insight into where the field is going in 2026.

www.kdnuggets.com

Jan 6, 2026•1 month ago•

7 Steps to Mastering Agentic AI

As AI systems begin handling more complex, multi-stage tasks, understanding agentic design is becoming essential. This article outlines seven practical steps to build reliable, effective AI agents.

www.kdnuggets.com

Dec 11, 2025•2 months ago•

10 Newsletters for Busy Data Scientists

Discover 10 top free newsletters for busy data scientists to efficiently stay updated on machine learning, AI, statistics, and data engineering. Curated content helps cut through noise, offering practical insights and career advice.

www.kdnuggets.com

Sep 23, 2025•5 months ago•4 min read

+5 more

A Gentle Introduction to MCP Servers and Clients

This post introduces the Model Context Protocol (MCP), a standard for AI systems to securely interact with external tools and data. It defines three roles: hosts (apps), clients (AI), and servers (resource wrappers) for reusable integration.

www.kdnuggets.com

Oct 2, 2025•5 months ago•4 min read

+5 more

Pixi: A Smarter Way to Manage Python Environments

Pixi revolutionizes Python environment management, solving reproducibility and portability challenges. It offers isolated, consistent project setups, streamlines CI/CD, and accelerates onboarding, ideal for data science, ML, and web development.

www.kdnuggets.com

Dec 5, 2025•2 months ago•5 min read

+8 more

The 5 FREE Must-Read Books for Every LLM Engineer

This article curates a list of 5 free, essential books for LLM engineers. The recommendations cover diverse, crucial topics including foundational theory, NLP, system scaling on TPUs, model interpretability, and cybersecurity risks.

www.kdnuggets.com

Nov 5, 2025•3 months ago•5 min read

+5 more

The 5 FREE Must-Read Books for Every Data Scientist

This article curates a list of five essential and free online books for aspiring data scientists. It emphasizes moving beyond just coding to build a deep understanding of statistical theory, algorithms, and practical applications.

www.kdnuggets.com

Nov 18, 2025•3 months ago•4 min read

+3 more

Prompt Engineering Templates That Work: 7 Copy-Paste Recipes for LLMs

This article offers 7 practical, copy-paste prompt engineering templates for LLMs across diverse tasks, including job applications, coding, creative writing, and business strategy, emphasizing structured and guided inputs for superior outputs.

www.kdnuggets.com

Oct 9, 2025•4 months ago•4 min read

+9 more

Are AI Browsers Any Good? A Day with Perplexity’s Comet and OpenAI’s Atlas

A hands-on review of AI browsers Perplexity Comet and OpenAI Atlas. While excellent for research and summarizing static content, they fail with dynamic web apps and introduce significant privacy and security concerns.

www.kdnuggets.com

Nov 26, 2025•3 months ago•6 min read

+4 more

How I Prepared for a Data Science Interview at a Large Tech Company

A product data scientist shares their interview preparation plan for a large tech company, emphasizing skills that differentiate a product role from traditional data science: advanced SQL, applied statistics, and business acumen.

www.kdnuggets.com

Nov 5, 2025•3 months ago•4 min read

+4 more

The KDnuggets Gradio Crash Course

Build ML web apps in minutes with Gradio's Python framework. Create interactive demos for models with text, image, or audio inputs with no frontend skills needed. Deploy and share instantly.

www.kdnuggets.com

Jan 6, 2026•1 month ago•

Time Series and Trend Analysis Challenge Inspired by Real World Datasets

This post analyzes 5 years of 10-Year Breakeven Inflation Rate data using three complementary time series techniques: moving averages, year-over-year changes, and Bollinger Bands. It demonstrates how each method provides distinct insights into market inflation expectations, revealing underlying trends, momentum shifts, and volatility periods, emphasizing the importance of choosing the right analytical lens.

www.kdnuggets.com

Dec 3, 2025•2 months ago•10 min read

+9 more

10 GitHub Repositories to Master Vibe Coding

Vibe coding is the new AI-driven approach to software development, using context-aware agent systems instead of simple Q&A. This article presents 10 GitHub repositories covering context engineering, tools, workflows, security, and more, to help developers master this advanced methodology.

www.kdnuggets.com

Dec 3, 2025•2 months ago•5 min read

+4 more

Building Full Stack Apps with Firebase Studio

An introduction to Firebase Studio, Google's cloud-based IDE. It integrates Firebase services and Gemini AI to streamline full-stack app development, enabling rapid prototyping and deployment with zero local setup.

www.kdnuggets.com

Nov 7, 2025•3 months ago•9 min read

+5 more

5 Cutting-Edge AutoML Techniques to Watch in 2026

This article discusses five cutting-edge AutoML techniques and trends that are expected to shape the landscape of highly automated machine learning model building in the 2026 year about to start.

www.kdnuggets.com

Dec 9, 2025•2 months ago•

From Excel to Python: 7 Steps Analysts Can Take Today

A 7-step guide for data analysts to transition their skills from Excel to Python. It covers mapping existing knowledge, learning fundamentals with libraries like Pandas, practicing on real data, and integrating Python with Excel.

www.kdnuggets.com

Oct 1, 2025•5 months ago•4 min read

+4 more

From Hustle to Structure: How to Build Repeatable Processes in Your Business (Sponsored)

Transitioning from reactive hustle to proactive structure by building simple, repeatable processes. If you are looking for practical ways to get started in the shift from hustle to structure, this article has you covered.

www.kdnuggets.com

Nov 11, 2025•3 months ago•

The Best Agentic AI Browsers to Look For in 2026

A quick look at the top 7 agentic AI browsers that can search the web for you, fill forms automatically, handle research, draft content, and streamline your entire workflow.

www.kdnuggets.com

Dec 29, 2025•2 months ago•

How to Build and Publish a Docker Image to Docker Hub

A step-by-step guide to containerizing a Python Flask app. Learn to write a Dockerfile, build a Docker image, test it locally, and publish it to Docker Hub, making your application portable and easily shareable.

www.kdnuggets.com

Sep 25, 2025•5 months ago•4 min read

+4 more

Emergent Introspective Awareness in Large Language Models

This post reviews research on 'Emergent Introspective Awareness' in LLMs, examining if models can self-report internal states. Using concept injection on Claude models, early hints of introspection are found, vital for interpretability and addressing issues like hallucinations.

www.kdnuggets.com

Dec 4, 2025•2 months ago•3 min read

+6 more

The Real Cost of Inaction: How Silos Hurt Productivity for Data Scientists (Sponsored)

The overarching goal is to maximize the return on analytical talent, shifting their focus entirely from data preparation to predictive model development, which is a necessary move if the business intends to compete in an AI-driven economy.

bit.ly

Dec 17, 2025•2 months ago•

5 Useful Python Scripts to Automate Boring Everyday Tasks

Spending too much time on repetitive tasks? These Python scripts will help you automate the mundane stuff that drains your productivity.

www.kdnuggets.com

Dec 19, 2025•2 months ago•

Make.com Automations for Saving Time as a Data Professional

This article introduces Make.com as a no-code visual platform for data professionals to automate repetitive tasks like data collection, cleaning, and reporting. It highlights how Make.com saves time, improves data accuracy, and integrates with AI, empowering professionals to focus on analysis rather than manual grunt work.

www.kdnuggets.com

Nov 24, 2025•3 months ago•9 min read

+9 more

How to Write Readable Python Functions Even If You’re a Beginner

This guide helps beginners write readable Python functions. It covers best practices like clear naming for functions and parameters, keeping functions concise, using docstrings, meaningful variable names, avoiding magic numbers, and implementing type hints for better code clarity.

www.kdnuggets.com

Nov 19, 2025•3 months ago•5 min read

+5 more

Hosting Language Models on a Budget

Learn how to run your own language model for free using lightweight models and Hugging Face Spaces.

www.kdnuggets.com

Dec 18, 2025•2 months ago•

An Introduction to Zapier Automations for Data Scientists

This post introduces Zapier automations for data scientists, explaining how to use no-code workflows to connect apps, automate repetitive tasks like data collection, reporting, and alerts, saving time, reducing errors, and boosting productivity.

www.kdnuggets.com

Nov 19, 2025•3 months ago•5 min read

+6 more

Finding Meaningful Work in the Age of Vibe Coding

Vibe coding has devalued coding. Is there any meaningful work still left for us?

www.kdnuggets.com

Dec 10, 2025•2 months ago•

Vibe Coding with GLM 4.6 Coding Plan

This article introduces Z.AI's GLM-4.6 coding model and its affordable ~$3/month subscription, the GLM Coding Plan. It provides a step-by-step tutorial on integrating it with OpenCode to build a complete website from a single prompt.

www.kdnuggets.com

Oct 20, 2025•4 months ago•4 min read

+3 more

Here’s When You Would Choose Spreadsheets Over SQL

Don't default to SQL for every data task. This post details when spreadsheets are the better choice: for small data, quick tasks, collaboration, visualization, and when working with non-technical teams.

www.kdnuggets.com

Oct 13, 2025•4 months ago•3 min read

+4 more

The Psychology of Bad Data Storytelling: Why People Misread Your Data

This post explores 10 psychological reasons why audiences misinterpret data, from cognitive biases to poor visualization. It provides actionable fixes for data storytellers to present information clearly and drive correct decisions.

www.kdnuggets.com

Oct 22, 2025•4 months ago•6 min read

+2 more

Getting Started with the Claude Agent SDK

Build programmable AI agents and CLI apps with Claude Agent SDK. This tutorial covers setup and developing "TrendSmith," a multi-tool app for market trend analysis, integrating web search, data fetching, and local storage.

www.kdnuggets.com

Nov 28, 2025•3 months ago•5 min read

+8 more

Cloud 101 for Business Owners (Sponsored)

If you've been hearing about "the cloud" for years but still aren't sure what it means for your business, we get it. Let's cut through the noise.

www.kdnuggets.com

Nov 4, 2025•3 months ago•

Qwen Code Leverages Qwen3 as a CLI Agentic Programming Tool

Qwen Code is a new agentic CLI programming tool powered by the Qwen3-Coder model. It enhances developer productivity by understanding codebases, suggesting optimizations, and automating tasks directly from the command line.

www.kdnuggets.com

Oct 1, 2025•5 months ago•4 min read

+5 more

The Best Local Coding LLMs You Can Run Yourself

This article explores the best local coding LLMs for developers, highlighting options like GLM-4, DeepSeekCoder V2, Qwen3-Coder, Codestral, and Code Llama. It details their features, performance, context windows, and hardware requirements.

www.kdnuggets.com

Sep 17, 2025•5 months ago•3 min read

+6 more

Top 7 Python Package Managers

This article explores 7 popular Python package managers: uv, pip, Poetry, Conda, Miniconda, Mamba, and Pixi. It details their features, installation, and ideal use cases, from beginner-friendly Anaconda to fast alternatives like uv and Mamba for data science.

www.kdnuggets.com

Oct 27, 2025•4 months ago•4 min read

+13 more

How Transformers Think: The Information Flow That Makes Language Models Work

Let's uncover how transformer models sitting behind LLMs analyze input information like user prompts and how they generate coherent, meaningful, and relevant output text "word by word".

www.kdnuggets.com

Dec 15, 2025•2 months ago•

Reinvent Customer Engagement with Dynamics 365: Turn Insights into Action

Enhance customer engagement with Microsoft Dynamics 365. This AI-powered platform unifies data across sales, marketing, and service, turning fragmented customer insights into actionable, personalized experiences.

www.kdnuggets.com

Oct 15, 2025•4 months ago•3 min read

+4 more

A 5-Step Guide to Tackling (Almost) Any Data Science Project

This guide outlines a 5-step framework for successful data science projects, prioritizing business alignment, data exploration, simple baselines, and robust feature engineering over premature optimization of complex models.

www.kdnuggets.com

Nov 18, 2025•3 months ago•6 min read

+4 more

What Is Cross-Validation? A Plain English Guide with Diagrams

This guide explains cross-validation, a technique for robustly evaluating machine learning models. It details why it's superior to a single train/test split and covers key methods like k-fold, stratified, and time-series CV.

www.kdnuggets.com

Oct 1, 2025•5 months ago•4 min read

+4 more

Top 5 Small AI Coding Models That You Can Run Locally

Explore the top 5 small AI coding models runnable locally, ensuring privacy & cost savings. This guide details gpt-oss-20b, Qwen3-VL-32B-Instruct, Apriel-1.5-15B-Thinker, Seed-OSS-36B-Instruct, and Qwen3-30B-A3B-Instruct-2507, noting their strengths for diverse developer workflows.

www.kdnuggets.com

Dec 5, 2025•2 months ago•5 min read

+8 more

How to Vibe Code on a Budget

AI-powered 'Vibe Coding' can be costly. This guide reveals a budget-friendly workflow using cheap coding plans, free tools, open-source models, and free API providers to help you code smarter without breaking the bank.

www.kdnuggets.com

Dec 2, 2025•2 months ago•4 min read

+4 more

5 Fun AI Agent Projects for Absolute Beginners

Discover five fun, beginner-friendly AI agent projects, from a pure Python calendar assistant to advanced research bots. This guide provides curated video tutorials to help you build agents that can act, reason, and automate tasks.

www.kdnuggets.com

Oct 3, 2025•4 months ago•4 min read

+4 more

How I Actually Use Statistics as a Data Scientist

A data scientist shares that most roles require applied statistics for business problems, not deep academic theory. Focus on concepts like A/B testing, ML model interpretation, and descriptive analysis. Learn core concepts for interviews and advanced skills on the job.

www.kdnuggets.com

Oct 7, 2025•4 months ago•3 min read

+6 more

The 5 FREE Must-Read Books for Every Machine Learning Engineer

This article lists 5 free, must-read books for Machine Learning Engineers, emphasizing the importance of deep theoretical understanding over just practical application. It covers foundational math, core algorithms, statistical learning, pattern recognition, and building ML systems.

www.kdnuggets.com

Nov 25, 2025•3 months ago•6 min read

+7 more

SQL for Data Analysts: Essential Queries for Data Extraction & Transformation

A concise guide for data analysts on 15 essential SQL queries. It covers fundamental commands like SELECT, WHERE, and GROUP BY, to advanced functions like JOINs, CASE, and window functions for effective data extraction and transformation.

www.kdnuggets.com

Oct 20, 2025•4 months ago•3 min read

+3 more

Pandas: Advanced GroupBy Techniques for Complex Aggregations

This guide explores advanced Pandas GroupBy techniques beyond simple sums and means. It covers the distinct uses of agg, transform, apply, and filter for complex scenarios like conditional logic, weighted metrics, and time-series analysis.

www.kdnuggets.com

Oct 21, 2025•4 months ago•5 min read

+3 more

Staying Ahead of AI in Your Career

To thrive in the age of AI, collaborate with it instead of resisting. This post advises developing adaptive intelligence, integrating AI into your workflow, and strengthening irreplaceable human skills like empathy and strategic thinking.

www.kdnuggets.com

Nov 27, 2025•3 months ago•4 min read

+3 more

How to Handle Large Datasets in Python Even If You’re a Beginner

You don’t need advanced skills to work with large datasets. With Python’s built-in features and libraries, you can handle large datasets without breaking a sweat even if you're a beginner.

www.kdnuggets.com

Dec 17, 2025•2 months ago•

7 Free Remote MCPs You Must Use As A Developer

This post highlights seven free remote Model Context Protocol (MCP) servers for developers. These tools, including integrations for GitHub, Figma, and Notion, connect AI assistants to essential services to streamline workflows.

www.kdnuggets.com

Oct 28, 2025•4 months ago•5 min read

+6 more

Facing The Threat of AIjacking

AIjacking, a new threat exploiting LLMs via prompt injection, allows AI agents to perform unauthorized actions like data exfiltration without human interaction, bypassing traditional security. The article outlines practical defenses and emphasizes a security-first AI approach.

www.kdnuggets.com

Oct 27, 2025•4 months ago•6 min read

+7 more

We Benchmarked DuckDB, SQLite, and Pandas on 1M Rows: Here’s What Happened

A performance benchmark of DuckDB, SQLite, and Pandas on a 1M row dataset. The test compared speed and memory usage for common data analysis tasks, revealing DuckDB's consistent high performance and balanced efficiency.

www.kdnuggets.com

Oct 10, 2025•4 months ago•5 min read

+4 more

7 Python Libraries Every Analytics Engineer Should Know

This article highlights 7 essential Python libraries for analytics engineers, covering data manipulation (Polars), quality (Great Expectations), transformation (dbt), orchestration (Prefect), and more to streamline workflows.

www.kdnuggets.com

Sep 23, 2025•5 months ago•5 min read

+5 more

7 Tiny AI Models for Raspberry Pi

This is a list of top LLM and VLMs that are fast, smart, and small enough to run locally on devices as small as a Raspberry Pi or even a smart fridge.

www.kdnuggets.com

Dec 22, 2025•2 months ago•

Top 7 Open Source AI Coding Models You Are Missing Out On

This article introduces 7 top open-source AI coding models for local deployment, offering privacy, security, and cost savings over cloud-based tools. It details each model's strengths, architecture, and benchmarks, providing alternatives for sensitive codebases.

www.kdnuggets.com

Nov 21, 2025•3 months ago•7 min read

+15 more

Our favourite Black Friday deal to Learn SQL, AI, Python, and become a certified data analyst!

DataCamp offers a Black Friday deal (Nov 12-Dec 4) with 50% off full platform access, including 600+ courses, career tracks, and certifications. Learn SQL, Python for data analysis, or dive into AI engineering with tracks covering OpenAI, LLMs, and MLOps. Gain job-ready skills and industry certifications.

www.kdnuggets.com

Nov 25, 2025•3 months ago•4.3 min read

+10 more

5 Cutting-Edge Natural Language Processing Trends Shaping 2026

By 2026, NLP will evolve beyond current models, focusing on five key trends: efficient attention mechanisms, autonomous language agents, world models for reasoning, knowledge graphs for context, and on-device NLP for speed and privacy.

www.kdnuggets.com

Sep 24, 2025•5 months ago•4 min read

+5 more

The Complete Guide to Using Google AI Studio

A comprehensive guide to Google AI Studio, covering account setup, interface features, and model selection. Learn to prototype with Gemini, generate code and images, and build AI applications directly from the web-based workspace.

www.kdnuggets.com

Nov 3, 2025•3 months ago•8 min read

+5 more

Top SQL Patterns from FAANG Data Science Interviews (with Code)

Ace your FAANG data science interviews by mastering the top 5 SQL patterns. This guide covers aggregation, subqueries, window functions, moving averages, and conditional aggregation, complete with practical PostgreSQL examples.

www.kdnuggets.com

Nov 20, 2025•3 months ago•8 min read

+4 more

7 High Paying Side Hustles for Students

Make extra money between classes with beginner-friendly freelance platforms that fit your lifestyle.

www.kdnuggets.com

Dec 30, 2025•2 months ago•

Git for Vibe Coders

This beginner-friendly guide teaches AI "vibe coders" how to integrate Git and GitHub into their workflow to prevent data loss from AI tools like Claude Code. It covers essential commands for version control, saving snapshots, branching, and automatic backups.

www.kdnuggets.com

Nov 21, 2025•3 months ago•5 min read

+4 more

5 Useful Python Scripts for Busy Data Analysts

This article outlines five practical Python scripts designed to automate common, time-consuming tasks for data analysts, such as report formatting, data reconciliation, dashboard creation, and scheduled data pulls.

www.kdnuggets.com

Oct 21, 2025•4 months ago•3 min read

+4 more

5 Docker Containers for Your AI Infrastructure

This post highlights five essential Docker containers to build a robust AI infrastructure: JupyterLab for experimentation, Airflow for orchestration, MLflow for tracking, Redis for caching, and FastAPI for serving models.

www.kdnuggets.com

Oct 20, 2025•4 months ago•5 min read

+6 more

The Best Proxy Providers for Large-Scale Scraping for 2026

This article compares top proxy providers for large-scale web scraping in 2026, including Bright Data, Oxylabs, Infatica, and NetNut. It highlights their features, pricing, and ideal use cases, concluding Bright Data is superior.

www.kdnuggets.com

Nov 30, 2025•3 months ago•5 min read

+11 more

10 Useful Python One-Liners for CSV Processing

Discover 10 powerful Python one-liners for common CSV tasks. This guide shows how to sum, group, filter, and analyze CSV data efficiently using built-in modules, perfect for quick data exploration without external libraries like pandas.

www.kdnuggets.com

Oct 14, 2025•4 months ago•4 min read

+3 more

5 Fun NLP Projects for Absolute Beginners

This article outlines five hands-on Natural Language Processing (NLP) projects for beginners. It covers tokenization, Named Entity Recognition (NER), sentiment analysis, text generation, and machine translation with video tutorials.

www.kdnuggets.com

Nov 17, 2025•3 months ago•4 min read

+6 more

A Gentle Introduction to vLLM for Serving

An introduction to vLLM, an open-source serving engine that optimizes LLM inference. It uses a core innovation called PagedAttention to achieve high throughput, low latency, and efficient memory use for production applications.

www.kdnuggets.com

Sep 18, 2025•5 months ago•4 min read

+5 more

Generative AI Hype Check: Can It Really Transform SDLC?

This post explores Generative AI's impact on the Software Development Lifecycle (SDLC), weighing its productivity benefits against limitations like the need for human oversight, security risks, and struggles with complex, novel tasks.

www.kdnuggets.com

Oct 29, 2025•4 months ago•5 min read

+4 more

Prompt Engineering for Data Quality and Validation Checks

This article explores how prompt engineering with LLMs can revolutionize data validation, moving beyond static rules to intelligent reasoning. It covers designing effective prompts, embedding domain knowledge, and automating validation pipelines for robust data quality.

www.kdnuggets.com

Dec 18, 2025•2 months ago•5 min read

+7 more

Weights & Biases: A KDnuggets Crash Course

A practical crash course on using Weights & Biases (W&B) for MLOps. Learn to track experiments, version datasets and models with Artifacts, run hyperparameter sweeps, and improve reproducibility in your ML workflows.

www.kdnuggets.com

Oct 6, 2025•4 months ago•5 min read

+4 more

Airtable + GPT: Prototyping a Lightweight RAG System with No-Code Tools

This tutorial guides users through prototyping a lightweight RAG system using Airtable for knowledge, OpenAI's GPT models for generation, and Pipedream for no-code orchestration. It details setting up the workflow and offers code and AI-agent methods for building it.

www.kdnuggets.com

Sep 17, 2025•5 months ago•5 min read

+7 more

Using NotebookLM to Tackle Tough Questions: Interview Smarter, Not Harder

This guide demonstrates how to use Google's NotebookLM to enhance technical interview preparation. It uses a Meta recommendation system problem to showcase how the AI tool creates summaries, quizzes, and visual aids to deepen understanding.

www.kdnuggets.com

Nov 6, 2025•3 months ago•5 min read

+4 more

The Beginner’s Guide to Tracking Token Usage in LLM Apps

This guide explains why tracking token usage in LLM apps is vital for cost and performance. It provides a step-by-step tutorial on using LangSmith with LangChain and Hugging Face to monitor and visualize token consumption.

www.kdnuggets.com

Oct 14, 2025•4 months ago•6 min read

+3 more

5 Signs Your Business Is Ready For AI (Sponsored)

How do you know if you're ready to take the AI plunge? Here are five dead giveaways that AI could transform how you work.

www.kdnuggets.com

Oct 14, 2025•4 months ago•

Top 7 ChatGPT Alternatives You Can Try For Free

This article explores seven powerful and free alternatives to ChatGPT for tasks like research, coding, and content creation. It highlights the unique features of Microsoft Copilot, Google Gemini, Grok, You.com, and others.

www.kdnuggets.com

Nov 11, 2025•3 months ago•6 min read

+2 more

5 Workflow Automation Tools for All Professionals

This article reviews five powerful workflow automation tools—n8n, Zapier, Make, Microsoft Power Automate, and ClickUp—designed for professionals to streamline repetitive digital tasks using features like AI integration and visual builders.

www.kdnuggets.com

Dec 16, 2025•2 months ago•5 min read

+9 more

The One Data Analyst Role That’s AI-Proof

And it pays $100K+ more than regular data analyst jobs.

www.kdnuggets.com

Dec 12, 2025•2 months ago•

The Hidden Curriculum of Data Science Interviews: What Companies Really Test

Beyond technical tests, data science interviews have a 'hidden curriculum.' This post reveals the non-technical skills companies truly evaluate, such as business translation, handling ambiguity, and understanding trade-offs.

www.kdnuggets.com

Oct 23, 2025•4 months ago•5 min read

+3 more

5 Data Privacy Stories from 2025 Every Analyst Should Know

In this article we look at 5 specific privacy stories from 2025 that changed how analysts work day to day, from the code they write to the reports they publish.

www.kdnuggets.com

Dec 17, 2025•2 months ago•

Probability Concepts You’ll Actually Use in Data Science

How can we reason with uncertainty and make smarter decisions from data? This article explains the key probability ideas in data science.

www.kdnuggets.com

Dec 23, 2025•2 months ago•

Emerging Trends in AI Ethics and Governance for 2026

in 2026, people want accountability frameworks that feel real, enforceable, and grounded in how AI behaves in live environments.

www.kdnuggets.com

Dec 15, 2025•2 months ago•

Beginner’s Guide to VibeVoice

This guide introduces VibeVoice, Microsoft's open-source, next-gen Text-to-Speech framework for expressive, multi-speaker audio. Learn to set up VibeVoice-1.5B on Google Colab, download the model, create transcripts, run inference, and troubleshoot common issues. It highlights VibeVoice's quality and open-source benefits.

www.kdnuggets.com

Sep 22, 2025•5 months ago•4 min read

+9 more

API Development for Web Apps and Data Products

A practical guide to API development for web apps and data products. Covers essential principles from user-centric design and RESTful practices to robust security, scaling strategies, and the importance of clear documentation.

www.kdnuggets.com

Oct 28, 2025•4 months ago•4 min read

+4 more

5 Critical Feature Engineering Mistakes That Kill Machine Learning Projects

This article outlines 5 critical feature engineering mistakes—data leakage, dimensionality trap, target encoding errors, outlier mismanagement, and model-feature mismatch—that often doom ML projects, offering practical solutions for building robust, deployable models.

www.kdnuggets.com

Dec 4, 2025•2 months ago•15 min read

+10 more

Top 5 Open-Source LLM Evaluation Platforms

If you’re building an LLM app, these open-source tools help you test, track, and improve your model’s performance easily.

www.kdnuggets.com

Dec 8, 2025•2 months ago•

Python for Data Science (Free 7-Day Mini-Course)

This post outlines a free, 7-day mini-course for beginners learning Python for data science. It covers fundamental skills like data structures, file I/O, string manipulation, and error handling using only core Python.

www.kdnuggets.com

Sep 29, 2025•5 months ago•11 min read

+4 more

What Is Big Tech’s Influence on AI Development?

Big Tech's massive investments accelerate AI development, but their dominance over cloud infrastructure and the AI supply chain raises serious concerns about stifling competition, market control, and significant ethical challenges.

www.kdnuggets.com

Sep 25, 2025•5 months ago•6 min read

+5 more

6 Docker Tricks to Simplify Your Data Science Reproducibility

Read these 7 tricks for treating your Docker container like a reproducible artifact, not a disposable wrapper.

www.kdnuggets.com

Jan 5, 2026•1 month ago•

10 Useful Python One-Liners for Data Engineering

Explore 10 practical Python one-liners for data engineering using pandas. This guide covers common tasks like parsing JSON, analyzing performance logs, detecting schema changes, monitoring APIs, and optimizing memory usage.

www.kdnuggets.com

Sep 25, 2025•5 months ago•4 min read

+3 more

Building Pure Python Web Apps with Reflex

This tutorial introduces Reflex, an open-source library for building scalable, full-stack web applications entirely in Python. It covers installation, core concepts like state management, and guides you through building a to-do list app.

www.kdnuggets.com

Oct 13, 2025•4 months ago•4 min read

+4 more

Nano Banana Practical Prompting & Usage Guide

This guide explores Google's "Nano Banana" (Gemini 2.5 Flash) AI image model. Learn its advanced features, like multi-image composition and semantic inpainting, with practical prompting strategies and usage tips.

www.kdnuggets.com

Sep 26, 2025•5 months ago•8 min read

+4 more

Top 7 Python ETL Tools for Data Engineering

Building data pipelines? These Python ETL tools will make your life easier.

www.kdnuggets.com

Jan 6, 2026•1 month ago•

5 Excel AI Lessons I Learned the Hard Way

This article shares 5 hard-won lessons for building trustworthy machine learning systems in Excel with XLMiner. It covers multi-method outlier detection, setting random seeds, 3-way data partitioning, monitoring overfitting, and implementing data validation.

www.kdnuggets.com

Nov 26, 2025•3 months ago•10 min read

+11 more

Beginner’s Guide to Creating Your Own Python Shell with the cmd Module

Learn to build a custom interactive command-line shell in Python using the built-in `cmd` module. This step-by-step guide covers creating commands, parsing arguments with `shlex`, adding a help system, and creating aliases.

www.kdnuggets.com

Sep 24, 2025•5 months ago•5 min read

+3 more

What’s on My Bookmarks Bar: Data Science Edition

A data scientist shares 10 essential bookmarks for staying updated and productive. The list covers everything from new research and trending code to unique datasets, quick visualization tools, and job listings.

www.kdnuggets.com

Nov 3, 2025•3 months ago•3 min read

+3 more

Beginner’s Guide to Data Analysis with Polars

Learn the fundamentals of data analysis with Polars, a high-performance Python library. This beginner-friendly guide uses a coffee shop dataset to walk you through installation, data creation, manipulation, and analysis.

www.kdnuggets.com

Sep 19, 2025•5 months ago•5 min read

+3 more

Top 7 Open Source OCR Models

Best OCR and vision language models you can run locally that transform documents, tables, and diagrams into flawless markdown copies with benchmark-crushing accuracy.

www.kdnuggets.com

Dec 24, 2025•2 months ago•

We Used 3 Feature Selection Techniques: This One Worked Best

This article compares three feature selection techniques—Filter, Wrapper (RFE), and Embedded (Lasso)—using the scikit-learn Diabetes dataset. The experiment concludes that the Embedded method, Lasso, offered the best performance.

www.kdnuggets.com

Oct 2, 2025•5 months ago•3 min read

+4 more

Accessing Data Commons with the New Python API Client

This article introduces Data Commons, Google's open-source initiative to organize public data via a knowledge graph. It details using the new Python API client to access datasets, covering API key setup, library installation, and fetching statistical variables and entities using DCIDs, with examples for Pandas DataFrames.

www.kdnuggets.com

Oct 16, 2025•4 months ago•5 min read

+7 more

5 Signs Your Business Is a Prime Target for Cyberattacks

This post highlights 5 signs your business is a prime cyberattack target: weak passwords, outdated software, untrained staff, poor backups, and no monitoring. It stresses that SMBs are heavily targeted and prevention is key.

www.kdnuggets.com

Oct 7, 2025•4 months ago•4 min read

+3 more

5 Tips for Building Useful Streamlit Dashboards in Minutes

This article provides five practical tips to build more efficient and useful Streamlit dashboards. It covers performance enhancement with caching, improving UX with input batching, state management, displaying KPIs, and extending features.

www.kdnuggets.com

Sep 18, 2025•5 months ago•3 min read

+4 more

7 Steps to Build a Simple RAG System from Scratch

A step-by-step guide to building a simple Retrieval-Augmented Generation (RAG) system from scratch. This tutorial walks through data prep, chunking, vector embeddings with FAISS, and answer generation using an open-source LLM.

www.kdnuggets.com

Nov 17, 2025•3 months ago•9 min read

+4 more

Why I Quit My 6 Figure Side Hustle for a Full-Time Data Science Job

A data scientist explains their decision to leave a six-figure freelance career for a lower-paying, full-time job. The choice prioritizes long-term career security, paid learning, and building skills less replaceable by AI.

www.kdnuggets.com

Sep 30, 2025•5 months ago•4 min read

+3 more

Lux + Pandas: Auto-Visualizations for Lazy Analysts

Lux is a Python library that enhances Pandas to automate exploratory data analysis. It automatically generates insightful visualizations when a DataFrame is displayed, saving analysts time on repetitive manual plotting tasks.

www.kdnuggets.com

Nov 24, 2025•3 months ago•4 min read

+3 more

5 Strategic Steps to a Seamless AI Integration

This guide provides a five-step framework for successful AI integration: defining the problem, building a strong data foundation, upskilling employees, starting with small pilots, and embedding ethical AI practices from the start.

www.kdnuggets.com

Sep 16, 2025•5 months ago•5 min read

+4 more

3 Unexpected Uses for NotebookLM

Discover 3 unexpected ways to use NotebookLM for advanced workflows. Learn to combine its power with other AI tools for website gap analysis, rigorous source verification, and transforming complex data into presentation-ready insights.

www.kdnuggets.com

Nov 20, 2025•3 months ago•6 min read

+7 more

My Honest Review on Abacus AI: ChatLLM, DeepAgent & Enterprise

This review praises Abacus AI for its exceptional value, offering access to 18+ top LLMs for just $10/month. It highlights key features like ChatLLM for teams, the autonomous DeepAgent, and a comprehensive MLOps enterprise platform.

www.kdnuggets.com

Nov 24, 2025•3 months ago•5 min read

+6 more

Context Engineering Explained in 3 Levels of Difficulty

Long-running LLM applications degrade when context is unmanaged. Context engineering turns the context window into a deliberate, optimized resource. Learn more in this article.

www.kdnuggets.com

Jan 5, 2026•1 month ago•

10 GitHub Repositories to Master Machine Learning Deployment

Discover 10 top GitHub repositories to master machine learning deployment, transforming models into products. This guide covers essential MLOps concepts, from packaging and API exposure to cloud deployment and building production-ready ML applications.

www.kdnuggets.com

Dec 11, 2025•2 months ago•4 min read

+11 more

Data Observability in Analytics: Tools, Techniques, and Why It Matters

Discover Data Observability, the process of monitoring data system health. This post breaks down its five pillars (freshness, volume, schema, distribution, lineage), its benefits, lifecycle, and key industry tools to ensure reliable analytics.

www.kdnuggets.com

Nov 4, 2025•3 months ago•6 min read

+3 more

Transform Raw Data Into Real Impact

Bay Path University's online Master's in Applied Data Science helps working professionals transform data into real impact. The program offers practical, hands-on learning, including Generative AI expertise, to advance careers.

www.kdnuggets.com

Nov 11, 2025•3 months ago•2 min read

+10 more

How to Write Efficient Python Data Classes

Writing efficient Python data classes cuts boilerplate while keeping your code clean. And this article will teach you how.

www.kdnuggets.com

Dec 12, 2025•2 months ago•

Beginner’s Guide to Data Extraction with LangExtract and LLMs

A beginner's guide to LangExtract, Google's open-source Python library. It leverages LLMs like Gemini and OpenAI to extract structured information from unstructured text using simple prompts and few-shot examples.

www.kdnuggets.com

Nov 4, 2025•3 months ago•3 min read

+5 more

Context Engineering is the New Prompt Engineering

Prompt engineering is obsolete. Context engineering, which builds intelligent environments with data, memory, and structure, now governs AI's consistency and depth. It's about designing worlds for models to think within, fostering co-intelligence and adaptive systems.

www.kdnuggets.com

Dec 1, 2025•3 months ago•5 min read

+7 more

Why model distillation is becoming the most important technique in production AI

Nebius Token Factory customers use distillation today for search ranking, grammar correction, summarization, chat quality improvement, code refinement, and dozens of other narrow tasks.

www.kdnuggets.com

Dec 9, 2025•2 months ago•

I Asked ChatGPT, Claude and DeepSeek to Build Tetris

Which of these state-of-the-art models writes the best code?

www.kdnuggets.com

Jan 5, 2026•1 month ago•

Loading suggestions...

Loading home content...

Latest Posts

7 AI Tools I Can’t Live Without as a Professional Data Scientist

A Complete Guide to Seaborn

Statistics at the Command Line for Beginner Data Scientists

10 Polars One-Liners for Speeding Up Data Workflows

7 Best Chrome Extensions for Agentic AI

Building a Gmail Inbox Management Agent in n8n

Building AI Automations with Google Opal

The Data Detox: Training Yourself for the Messy, Noisy, Real World

Free AI and Data Courses with 365 Data Science— 100% Unlimited Access until Nov 21

5 Practical Examples for ChatGPT Agents

5 Top AI-Powered App Builders

5 Fun Data Science Projects for Absolute Beginners

5 Fun Docker Projects for Absolute Beginners

The Best Web Scraping APIs for AI Models in 2026

The Algorithmic X-Men

How I Built a Data Cleaning Pipeline Using One Messy DoorDash Dataset

Prompt Engineering for Outlier Detection

TPOT: Automating ML Pipelines with Genetic Algorithms in Python

The 5 FREE Must-Read Books for Every AI Engineer

7 Steps to Mastering Data Storytelling for Business Impact

Decoding Agentic AI: The Rise of Autonomous Systems

Data Analytics Automation Scripts with SQL Stored Procedures

Why Do Language Models Hallucinate?

5 Useful Python Scripts for Busy Data Engineers

5 Docker Containers for Language Model Development

How to Build Production-Ready UI Prototypes in Minutes Using Google Stitch

Building a Simple Data Quality DSL in Python

5 Strategic Steps to a Seamless AI Integration

5 Cutting-Edge MLOps Techniques to Watch in 2026

5 AI-Assisted Coding Techniques Guaranteed to Save You Time

From Dataset to DataFrame to Deployed: Your First Project with Pandas & Scikit-learn

Mapping the AI Education Surge: Which States and Schools Are Leading the Pack in 2025

Automating Web Search Data Collection for AI Models with SerpApi

How To Set Business Goals You’ll Actually Reach (Sponsored)

5 Emerging Trends in Data Engineering for 2026

What To Look For In A Cloud Services Provider (Sponsored)

Unlock Business Value: Build a Data & Analytics Strategy That Delivers

Processing Large Datasets with Dask and Scikit-learn

Exploring Metaclasses in Python: Unleashing the Power of Class Creation

Debunking 5 Myths About Cloud Computing for Small Business (Sponsored)

Top 5 Open Source Video Generation Models

Deploy an AI Analyst in Minutes: Connect Any LLM to Any Data Source with Bag of Words

The Lazy Data Scientist’s Guide to Exploratory Data Analysis

ChatLLM. An Honest Review of Our All-in-One AI Platform

What Does the End of GIL Mean for Python?

5 Practical Docker Configurations

Is ChatGPT Study Mode a Hidden Gem or a Gimmick?

5 Free Tools to Experiment with LLMs in Your Browser

10 Lesser-Known Python Libraries Every Data Scientist Should Be Using in 2026

Gistr: The Smart AI Notebook for Organizing Knowledge

10 Command-Line Tools Every Data Scientist Should Know

10 Python One-Liners to Optimize Your Hugging Face Transformers Pipelines

The Lazy Data Scientist’s Guide to Time Series Forecasting

Debunking 5 myths about cloud computing for small business (Sponsored)

Top 5 Text-to-Speech Open Source Models

Building Machine Learning Application with Django

5 Lightweight Alternatives to Pandas You Should Try

Data Cleaning at the Command Line for Beginner Data Scientists

7 LinkedIn Tricks to Get Noticed by Recruiters

Agentic AI Coding with Google Jules

Creating a Text to SQL App with OpenAI + FastAPI + SQLite

10 Essential Agentic AI Interview Questions for AI Engineers

7 ChatGPT Tricks to Automate Your Data Tasks

How Data Engineering Can Power Manufacturing Industry Transformation

5 NotebookLM Tips to Make Your Day a Little Easier

How To Get More Done Without Working More Hours (Sponsored)

Collecting Real-Time Data with APIs: A Hands-On Guide Using Python

The Benefits of an “Everything” Notebook in NotebookLM

How To Use Synthetic Data To Build a Portfolio Project

Top 5 Agentic Coding CLI Tools

Top 10 Free API Providers for Data Science Projects

The Complete Guide to Building Data Pipelines That Don’t Break

A Gentle Introduction to TypeScript for Python Programmers

The 10 AI Developments That Defined 2025

7 Steps to Mastering Agentic AI

10 Newsletters for Busy Data Scientists

A Gentle Introduction to MCP Servers and Clients

Pixi: A Smarter Way to Manage Python Environments

The 5 FREE Must-Read Books for Every LLM Engineer