Question:

Why is my implementation of linear regression not working?

Problem:

I am trying to implement linear regression from scratch in python.

For reference, here are the mathematical formulae I have used: >Equations

This is what I tried:

class LinearRegression:

def __init__(

self,

features: np.ndarray[np.float64],

targets: np.ndarray[np.float64],

) -> None:

self.features = np.concatenate((np.ones((features.shape[0], 1)), features), axis=1)

self.targets = targets

self.params = np.random.randn(features.shape[1] + 1)

self.num_samples = features.shape[0]

self.num_feats = features.shape[1]

self.costs = []

def hypothesis(self) -> np.ndarray[np.float64]:

return np.dot(self.features, self.params)

def cost_function(self) -> np.float64:

pred_vals = self.hypothesis()

return (1 / (2 * self.num_samples)) * np.dot((pred_vals - self.targets).T, pred_vals - self.targets)

def update(self, alpha: np.float64) -> None:

self.params = self.params - (alpha / self.num_samples) * (self.features.T @ (self.hypothesis() - self.targets))

def gradientDescent(self, alpha: np.float64, threshold: np.float64, max_iter: int) -> None:

converged = False

counter = 0

while not converged:

counter += 1

curr_cost = self.cost_function()

self.costs.append(curr_cost)

self.update(alpha)

new_cost = self.cost_function()

if abs(new_cost - curr_cost) < threshold:

converged = True

if counter > max_iter:

converged = True

I used the class like this:

regr = LinearRegression(features=np.linspace(0, 1000, 200, dtype=np.float64).reshape((20, 10)), targets=np.linspace(0, 200, 20, dtype=np.float64))

regr.gradientDescent(0.1, 1e-3, 1e+3)

regr.cost_function()

However, I am getting the following errors:

RuntimeWarning: overflow encountered in scalar power

return (1 / (2 * self.num_samples)) * (la.norm(self.hypothesis() - self.targets) ** 4)

RuntimeWarning: invalid value encountered in scalar subtract

if abs(new_cost - curr_cost) < threshold:

RuntimeWarning: overflow encountered in matmul

self.params = self.params - (alpha / self.num_samples) * (self.features.T @ (self.hypothesis() - self.targets))

Can anyone help me understand what is going wrong exactly?

Solution:

It is overflowing because you use to large numbers in your example. Try using:

regr = LinearRegression(features=np.linspace(0, 1000, 200, dtype=np.float64).reshape((20, 10))/1000, targets=np.linspace(0, 200, 20, dtype=np.float64)/1000)

regr.gradientDescent(0.1, 1e-3, 1e+3)

regr.cost_function()

It gives me an output of 0.00474225348416323.

>How to Create_function deprecated in PHP?

>How to delete duplicate names from Array in Typescript?

>How to do PHP Decryption from Node.js Encryption

>How to Set up the Android emulator?

>How to Set up the local environment for Angular development?

>How to solve encoding issue when writing to a text file, with Python?

>Testing react components using Hooks and Mocks

>Use Firebase Realtime Database with ASP.NET MVC App

>Use of Singletons in .NET Core in AWS Lambda

Python

Ritu Singh

Your Answer

Submit

Ritu Singh

0 Answers

Top Questions

Ritu Singh

0 Answers

Top Questions

Related Questions