Data Programming Assignment ,Help With DataFrame X Assignment,Python Programming AssignmentDebug ,Help With Python Assignment Statistic

Week 10 - Assessed Exercises
Data Programming with Python
In this set of exercises we will fit some regression models and create a stepwise AIC function. As
we learnt in lectures to fit a regression model, we need to create a DataFrame X and Series y.
X should contain the standardised version of all of the explanatory/ exogenous variables and y
should contain the standardised version of the response/ endogenous variable. To fit the intercept,
X must have an additional column of ones.
Each question asks you to write a function with a specific set of input arguments. The .py template
defines the function name and inputs for each question, do not change these. Be sure you test
your functions before you submit your code to make sure that they are outputting the correct
answer. Unless otherwise stated, all functions must have a return value. This week you should test
your code using both the prostate and diamonds datasets. Testing your functions with multiple
datasets should catch any error related to leaving the DataFrame names inside your function.
Include the import statements for all packages used within your code. Additionally, please include
the package prefixes (pd, np, etc.) for functions/methods from these packages, even if the command
runs in Canopy without the prefix. .
1. Write a function to create X and y for a given DataFrame df The function inputs are the
DataFrame df and the label of the response/endogenous variable rescol. The function should
return two objects, X and y (in that order), where X and why are both standardised and
the column of ones is the first column of X. (You may assume that none of the variables are
categorical)
2. Write a function that takes X and y as inputs and fits a linear regression model. The function
should return the rsquared value rounded to 4 decimal places
AIC is the Akaike information criterion. It’s designed to penalise models with lots of explanatory
variables so that we pick models which fit the data well but aren’t too complicated. In general, if
you have two models fitted to the same data, the model with the lowest AIC is preferable. The
AIC is given as part of the model summary with OLS .
The steps to run a forward selection AIC regression are:
(a) Run a linear regression with just the intercept column. Get the AIC
(b) Add in the explanatory variables individually, run a linear regression for each one and determine
how much they decreases the AIC
(c) Find the variable with the biggest decrease in AIC and include it in your linear model
(d) Repeat step (b)-(c) with this new linear model and remaining explanatory variables
(e) Repeat this process until none of the remaining explanatory variables reduce the AIC
The explanatory variables that have been included up to the stopping point are considered the
variables that produce a good fit without overcomplicating the model.
3. Write a function that performs the AIC algorithm for a given DataFrame X and Series y.
The function should return the names of the columns used for the model that gives the lowest
AIC. This question is worth 2 marks
All of your code should be written into the .py template. Save your filled .py file with the following
name structure SurnameFirstname Week10.py (where Surname and Firstname should be replaced
with your name) and upload it to Brightspace. You must also upload a PDF of your code.

QQ：99515681
WeChat：codinghelp
Email：99515681@qq.com
Work Time：8:00-23:00

Hots

Ghostwriter Cs1b Spring 2024 Tth Hw08h... 2024-04-19
Help With Managing Financial Risk Prob... 2024-04-19
Ghostwriter Cs 0449 – Project 5: /Dev/ 2024-04-19
Ghostwriter Elec 2141 Digital Circuit ... 2024-04-19
Help With Csc171 — Videogame Projecthe 2024-04-19
Help With Comp3411 Artificial Intellig 2024-04-19
Help With Stat3061: Random Processes &... 2024-04-19
Ghostwriter Accounting 452, Spring 202... 2024-04-19
Ghostwriter Finc5001 Foundations In Fi... 2024-04-19
Ghostwriter 7Ssmm712 – Topics In Appli 2024-04-19
Help With Com 337 - Film Studies For T... 2024-04-19
Ghostwriter Mes202tc - Digital Vlsi Sy... 2024-04-19
Ghostwriter Geography 2041B Distance S... 2024-04-19
Ghostwriter Ecos3006 International Tra... 2024-04-19
Help With Fit5225 2024 Sm1 Creating An... 2024-04-19
Help With Cit 593: Introduction To Com... 2024-04-19
Help With Math 4931: Take Home Examgho... 2024-04-19
Ghostwriter Csci 547|Info 533: Systems... 2024-04-19
Ghostwriter Cs536-S24 Intro To Pls And... 2024-04-19
Help With Fit5212 - Assignment 1Ghostw... 2024-04-19

Programming Assignment Help！