Member-only story

K-Nearest Neighbors — A Complete Guide

Harshit Maheshwari

4 min readApr 4, 2021

K-Nearest Neighbors is a simple and easy to use supervised machine learning algorithm. It can be used for both classification as well as regression problems. The disparity is due to the dependent variable’s characteristics. The dependent variable in classification KNN is categorical whereas the dependent variable in regression KNN is continuous. We will look at both of these in detail with an example.

KNN is a Lazy Algorithm as it does not perform training when the data is passed, rather it just saves that data and training in done when a query is passed. KNN works by measuring the distance between the query and all the points in the data, selecting the specified number of points closest to the query either by considering the most frequent label (in the case of classification) or averaging the labels (in the case of regression).

Also, it is a non-parametric algorithm, that is, it does not assume anything about the underlying dataset.

KNN Classifier:
We will use the breast cancer Wisconsin dataset. This is a classification problem where the aim is to classify instances as either being malignant or benign based on the following 10 features:

radius (mean of distances from center to points on the perimeter)
texture (standard deviation of gray-scale values)

Create an account to read the full story.

The author made this story available to Medium members only.
If you’re new to Medium, create a new account to read this story on us.

Continue in app

Or, continue in mobile web

Sign up with Google

Sign up with Facebook

Already have an account? Sign in

Written by Harshit Maheshwari

Cultivating AI insights for over 5 years, I'm on a mission to demystify the machine learning landscape, one Medium article at a time.

No responses yet

Write a response

What are your thoughts?

Also publish to my profile

More from Harshit Maheshwari

Web-scrapping Product Reviews in 3 Minutes.

Harshit Maheshwari

Web-scrapping Product Reviews in 3 Minutes.

The most demanded e-commerce product scrapping.

Jun 16, 2021

Empowering Equality: Fight Against AI Gender Bias (Women’s Day Edition)

Harshit Maheshwari

Empowering Equality: Fight Against AI Gender Bias (Women’s Day Edition)

Artificial Intelligence is like a super-smart robot that’s learning from everything around it. But sometimes, it picks up on the unfair…

Mar 9, 2024

How to clean a dirty Web Scrapped Data?

Harshit Maheshwari

How to clean a dirty Web Scrapped Data?

In this article we will clean the data that we extracted in the previous tutorial. If you haven’t read the previous one then it is…

Jun 21, 2021

Mojo Programming Language: A Python killer?

Harshit Maheshwari

Mojo Programming Language: A Python killer?

Programming languages are a fundamental tool for developers and software engineers. They provide a way to express instructions that a…

Oct 31, 2023

See all from Harshit Maheshwari

Recommended from Medium

How Does Our Sense of Humor Change With Age? A Statistical Analysis

In

Fanfare

by

Daniel Parris

How Does Our Sense of Humor Change With Age? A Statistical Analysis

How do our comedic sensibilities form and transform over time?

Jun 22, 2024

Data Science All Algorithm Cheatsheet 2025

In

Artificial Intelligence in Plain English

by

Ritesh Gupta

Data Science All Algorithm Cheatsheet 2025

Stories, strategies, and secrets to choosing the perfect algorithm.

Jan 5

Lists

Predictive Modeling w/ Python

20 stories1856 saves

Natural Language Processing

1977 stories1619 saves

Practical Guides to Machine Learning

10 stories2225 saves

ChatGPT prompts

51 stories2643 saves

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

In

Level Up Coding

by

Jacob Bennett

The 5 paid subscriptions I actually use in 2025 as a Staff Software Engineer

Tools I use that are cheaper than Netflix

Jan 7

Surrogate Modeling: The Secret to Faster, Smarter Engineering

In

AI Advances

by

Shuai Guo, PhD

Surrogate Modeling: The Secret to Faster, Smarter Engineering

Its fundamentals, capabilities, and engineering applications

6d ago

Temperature, top_p and top_k: Temperature zero does not always make an LLM deterministic

Stefan Möhl

Temperature, top_p and top_k: Temperature zero does not always make an LLM deterministic

Temperature is an LLM setting that is often described vaguely as: “How creative the LLM should be”, and when you dig a little bit: “How…

Jan 15

Chroma Basics: Document Primitive

Amikos Tech

Chroma Basics: Document Primitive

Developing RAG intuition: A Guide to Chroma Document primitives

Dec 2, 2024

See more recommendations

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams