国产人妻人伦精品_欧美一区二区三区图_亚洲欧洲久久_日韩美女av在线免费观看

合肥生活安徽新聞合肥交通合肥房產(chǎn)生活服務(wù)合肥教育合肥招聘合肥旅游文化藝術(shù)合肥美食合肥地圖合肥社保合肥醫(yī)院企業(yè)服務(wù)合肥法律

代寫(xiě)MS6711、代做Python語(yǔ)言程序
代寫(xiě)MS6711、代做Python語(yǔ)言程序

時(shí)間:2025-03-07  來(lái)源:合肥網(wǎng)hfw.cc  作者:hfw.cc 我要糾錯(cuò)



MS6711 Data Mining
Homework 2
Instruction
This homework contains both coding and non-coding questions. Please submit two files,
1. One word or pdf document of answers and plots of ALL questions without coding details.
2. One jupyter notebook of your codes.
3. Questions 1 and 2 are about concepts, 3 - 6 are about coding.
1
Problem 1 [20 points]
We perform best subset, forward stepwise and backward stepwise selection on the same dataset with p
predictors. For each approach, we obtain p + 1 models containing 0, 1, 2, · · · , p predictors. Explain your
answer.
1. Which of the three models with same number of k predictors has smallest training RSS?
2. Which of the three models with same number of k predictors has smallest testing RSS? (best
subset, forward, backward, or cannot determine?)
3. True or False: The predictors in the k-variable model identified by forward stepwise are a subset of
the predictors in the (k + 1)-variable model identified by forward stepwise selection.
4. True or False: The predictors in the k-variable model identified by best subset are a subset of the
predictors in the (k + 1)-variable model identified by best subset selection.
5. True or False: The lasso, relative to OLS, is less flexible and hence will give improved prediction
accuracy when its increase in bias is less than its decrease in variance.
2
Problem 2 [20 points]
Suppose we estimate Lasso by minimizing
||Y − Xβ||2
2 + λ||β||1
for a particular value of λ. For part 1 to 5, indicate which of (a) to (e) is correct and explain your answer.
1. As we increase λ from 0, the training RSS will
(a) Increase initially, and then eventually start decreasing in an inverted U shape.
(b) Decrease initially, and then eventually start increasing in a U shape.
(c) Steadily increase.
(d) Steadily decrease.
(e) Remain constant.
2. Repeat 1. for test RSS.
3. Repeat 1. for variance.
4. Repeat 1. for (squared) bias.
3
Problem 3 [20 points]
These data record the level of atmospheric ozone concentration from eight daily meteorological mea surements made in the Los Angeles basin in 1976. We have the 330 complete cases1. We want to find
climate/weather factors that impact ozone readings. Ozone is a hazardous byproduct of burning fossil
fuels and can harm lung function. The data set for this problem is:
Variable name Definition
ozone Long Maximum Ozone
vh Vandenberg 500 mb Height
wind Wind speed (mph)
humidity Humidity (%)
temp Sandburg AFB Temperature
ibh Inversion Base Height
dpg Daggot Pressure Gradint
ibt Inversion Base Temperature
vis Visibility (miles)
doy Day of the Year
[Note: I would recommend you use R for this question, since python does not have package for
forward / backward selection. See the code example on Canvas. Or you may use the sample python code
I provided.]
1. Report result of linear regression using all variables. Note that ozone is the response variable to
predict. What variables are significant?
2. Report the selected variables using the following model selection approaches.
(a) All subset selection.
(b) Forward stepwise
(c) Backward stepwise
3. Compare the outcome of these methods with the significant variables found in the full linear regres sion in question 1.
4. Potentially, other transformation of covariates might be important. What happens if you do all
subset selection using both the original variables and their square? That is, for all variables, include
4
both
X, X2
in the linear regression model for all subset selection.
5
Problem 4 [20 points]
In this exercise, we will predict the number of applications received using the other variables in the College
data set.
Private Public/private school indicator
Apps Number of applications received
Accept Number of applicants accepted
Enroll Number of new students enrolled
Top10perc New students from top 10% of high school class
Top25perc 1 = New students from top 25 % of high school class
F.Undergrad Number of full-time undergraduates
P.Undergrad Number of part-time undergraduates
Outstate Out-of-state tuition
Room.Board Room and board costs
Books Estimated book costs
Personal Estimated personal spending
PhD Percent of faculty with Ph.D.
Terminal Percent of faculty with terminal degree
S.F.Ratio Student faculty ratio
perc.alumni Percent of alumni who donate
Expend Instructional expenditure per student
Grad.Rate Graduation rate
1. Split the data set into a training set and a test set.
2. Fit a linear regression model using OLS on the training set, and report the test error obtained.
3. Fit a ridge regression model on the training set, with λ chosen by cross-validation. Report the test
error obtained.
4. Fit a lasso model on the training set, with λ chosen by cross-validation. Report the test error
obtained, along with the number of non-zero coefficient estimates.
5. Fit a PCR model on the training set, with number of components chosen by cross-validation. Report
the test error obtained, along with the value of M selected by cross-validation.
6. Fit a PLS model on the training set, with number of components chosen by cross-validation. Report
the test error obtained, along with the value of number of components selected by cross-validation.
6
Problem 5 [20 points]
We will now try to predict per capita crime rate in the Boston data set.
crim per capita crime rate by town.
zn proportion of residential land zoned for lots over 25,000 sq.ft.
indus proportion of non-retail business acres per town.
chas Charles River dummy variable (= 1 if tract bounds river; 0 otherwise).
nox nitrogen oxides concentration (parts per 10 million).
rm 1 = average number of rooms per dwelling.
age proportion of owner-occupied units built prior to 1940.
dis weighted mean of distances to five Boston employment centres.
rad index of accessibility to radial highways.
tax full-value property-tax rate per $10,000.
ptratio pupil-teacher ratio by town.
black 1000(Bk − 0.63)2 where Bk is the proportion of blacks by town.
lstat lower status of the population (percent).
medv median value of owner-occupied homes in $1000s.
1. Try out some of the regression methods explored in this chapter, such as best subset selection, the
lasso, ridge regression, PCR and partial least squares. Present and discuss results for the approaches
that you consider.
2. Propose a model (or set of models) that seem to perform well on this data set, and justify your
answer. Make sure that you are evaluating model performance using validation set error, cross validation, or some other reasonable alternative, as opposed to using training error.
3. Does your chosen model involve all of the features in the data set? Why or why not?
7
Problem 6 [20 points]
In a bike sharing system the process of obtaining membership, rental, and bike return is automated
via a network of kiosk locations throughout a city. In this problem, you will try to combine historical
usage patterns with weather data to forecast bike rental demand in the Capital Bikeshare program in
Washington, D.C.
You are provided hourly rental data collected from the Capital Bikeshare system spanning two years.
The file Bike train.csv, as the training set, contains data for the first 19 days of each month, while
Bike test.csv, as the test set, contains data from the 20th to the end of the month. The dataset includes
the following information:
daylabel day number ranging from 1 to 731
year, month, day, hour hourly date
season 1=spring,2=summer,3=fall,4=winter
holiday whether the day is considered a holiday
workingday whether the day is neither a weekend nor a holiday
weather 1 = clear, few clouds, partly cloudy
2 = mist + cloudy, mist + broken clouds, mist + few clouds, mist
3 = light snow, light rain + thunderstorm + scattered clouds, light rain
4 = 4 = heavy rain + ice pallets + thunderstorm + mist, snow + fog
temp temperature in Celsius
atemp ’feels like’ temperature in Celsius
humidity relative humidity
wind speed wind speed
count number of total rentals, outcome variable to predict
Predictions will be evaluated using the root mean squared error (RMSE), calculated as
RMSE =
v
u
u t
n
1
nX
i=1
(yi − ybi)
2
where yi
is the true count, ybi
is the prediction, and n is the number of entries to be evaluated.
Build a model on train dataset to predict the bikeshare counts for the hours recorded in the test
dataset. Report your prediction RMSE on testing set.
Some tips
• This is a relatively open question, you may use any model you learnt from this class.
8
• It will be helpful to examine the data graphically to spot any seasonal pattern or temporal trend.
• There is one day in the training data with weird atemp record and another day with abnormal
humidity. Find those rows and think about what you want to do with them. Is there anything
unusual in the test data?
• It might be helpful to transform the count to log(count + 1). If you did that, do not forget to
transform your predicted values back to count.
• Think about how you would include each predictor into the model, as continuous or as categorical?
• Is there any transformation of the predictors or interactions between them that you think might be
helpful?
Try to summarize your exploration of the data, and modeling process. You may fit a few models and
chose one from them. You will receive points based on your write-up and test RMSE. This is not a
competition among the class to achieve the minimal RMSE, but your result should be in a reasonable
range.


請(qǐng)加QQ:99515681  郵箱:99515681@qq.com   WX:codinghelp



 

掃一掃在手機(jī)打開(kāi)當(dāng)前頁(yè)
  • 上一篇:INT5051代做、代寫(xiě)Python編程設(shè)計(jì)
  • 下一篇:代寫(xiě)COMP3334、代做C/C++,Python編程
  • 無(wú)相關(guān)信息
    合肥生活資訊

    合肥圖文信息
    流體仿真外包多少錢(qián)_專(zhuān)業(yè)CFD分析代做_友商科技CAE仿真
    流體仿真外包多少錢(qián)_專(zhuān)業(yè)CFD分析代做_友商科
    CAE仿真分析代做公司 CFD流體仿真服務(wù) 管路流場(chǎng)仿真外包
    CAE仿真分析代做公司 CFD流體仿真服務(wù) 管路
    流體CFD仿真分析_代做咨詢(xún)服務(wù)_Fluent 仿真技術(shù)服務(wù)
    流體CFD仿真分析_代做咨詢(xún)服務(wù)_Fluent 仿真
    結(jié)構(gòu)仿真分析服務(wù)_CAE代做咨詢(xún)外包_剛強(qiáng)度疲勞振動(dòng)
    結(jié)構(gòu)仿真分析服務(wù)_CAE代做咨詢(xún)外包_剛強(qiáng)度疲
    流體cfd仿真分析服務(wù) 7類(lèi)仿真分析代做服務(wù)40個(gè)行業(yè)
    流體cfd仿真分析服務(wù) 7類(lèi)仿真分析代做服務(wù)4
    超全面的拼多多電商運(yùn)營(yíng)技巧,多多開(kāi)團(tuán)助手,多多出評(píng)軟件徽y1698861
    超全面的拼多多電商運(yùn)營(yíng)技巧,多多開(kāi)團(tuán)助手
    CAE有限元仿真分析團(tuán)隊(duì),2026仿真代做咨詢(xún)服務(wù)平臺(tái)
    CAE有限元仿真分析團(tuán)隊(duì),2026仿真代做咨詢(xún)服
    釘釘簽到打卡位置修改神器,2026怎么修改定位在范圍內(nèi)
    釘釘簽到打卡位置修改神器,2026怎么修改定
  • 短信驗(yàn)證碼 豆包網(wǎng)頁(yè)版入口 破天一劍 目錄網(wǎng) 排行網(wǎng)

    關(guān)于我們 | 打賞支持 | 廣告服務(wù) | 聯(lián)系我們 | 網(wǎng)站地圖 | 免責(zé)聲明 | 幫助中心 | 友情鏈接 |

    Copyright © 2025 hfw.cc Inc. All Rights Reserved. 合肥網(wǎng) 版權(quán)所有
    ICP備06013414號(hào)-3 公安備 42010502001045

    国产人妻人伦精品_欧美一区二区三区图_亚洲欧洲久久_日韩美女av在线免费观看
    亚洲精品电影在线一区| www.日本久久久久com.| 久久99精品久久久久久久久久 | 欧洲黄色一级视频| 97久久精品人人澡人人爽缅北| 国产精品嫩草视频| 热99精品只有里视频精品| 国产精品av网站| 亚洲一区亚洲二区| 国产欧美在线一区二区| 国产精品视频区1| 欧美日韩精品免费看| 日韩在线欧美在线| 日韩高清专区| 91国产一区在线| 亚洲午夜精品福利| 国产精品中文久久久久久久| 精品国产免费一区二区三区| 欧美日韩亚洲一区二区三区四区| 日韩中文字幕久久| 欧洲美女7788成人免费视频| 国产成人一区二区三区电影| 午夜精品久久久久久久99黑人| 97久久伊人激情网| 午夜久久久久久久久久久| 91久久偷偷做嫩草影院| 亚洲 日韩 国产第一区| 久久久视频在线| 日本三级中文字幕在线观看| 久久久久综合一区二区三区| 天天夜碰日日摸日日澡性色av| 国产精品69页| 日本精品二区| 久久色免费在线视频| 狠狠色综合欧美激情| 国产精品高清一区二区三区| 国产深夜男女无套内射| 欧美日韩成人精品| 97欧洲一区二区精品免费| 色一情一乱一伦一区二区三区丨| 国产成人一区二区三区小说| 欧美一区三区二区在线观看| 国产精品久久久av| 国产精品亚洲第一区| 午夜探花在线观看| 国产传媒久久久| 欧美牲交a欧美牲交aⅴ免费下载 | 日韩视频免费播放| 日韩在线观看成人| 国模精品系列视频| 一本一生久久a久久精品综合蜜| 91国产美女视频| 日韩美女在线观看| 久久福利视频网| 国产伦精品一区| 亚洲综合在线中文字幕| 99精品视频在线看| 日本一区二区不卡高清更新| 国产精品十八以下禁看| 成人久久18免费网站图片| 日产中文字幕在线精品一区| 国产精品免费看久久久香蕉| 国产欧美一区二区白浆黑人| 婷婷五月色综合| 国产精品久久久久久五月尺| 国产九区一区在线| 无码人妻精品一区二区蜜桃网站 | 国产欧洲精品视频| 日韩一区免费观看| 精品国内亚洲在观看18黄| 国产区精品视频| 日本精品一区二区三区在线播放视频| 国产精品视频26uuu| 97久久国产亚洲精品超碰热| 欧美日韩精品综合| 亚洲色欲久久久综合网东京热| 久久九九热免费视频| av观看久久| 黄色a级片免费看| 亚洲黄色成人久久久| 国产精品日韩av| 久久免费精品日本久久中文字幕| 国产淫片免费看| 日韩欧美在线电影| 欧美激情国产精品| 久久精品一本久久99精品| 99精品国产一区二区| 免费国产一区| 日韩欧美三级一区二区| 亚洲一区二区精品在线观看| 国产精品二区二区三区| 久久久久在线观看| 91精品国产乱码久久久久久久久 | 色婷婷久久一区二区| 97人人模人人爽人人喊38tv | 国产精品无码av在线播放| 69精品丰满人妻无码视频a片| 国产日韩欧美另类| 欧美不卡1区2区3区| 日本久久高清视频| 欧美一区二区三区艳史| 精品不卡在线| 国产精品久久久久久久久久ktv| 国产不卡视频在线| 国产精品99久久久久久久久久久久| 国内精品美女av在线播放| 青青青国产精品一区二区| 日韩av电影中文字幕| 亚洲精品蜜桃久久久久久| 久操成人在线视频| 国产精品国语对白| 国产精品日日摸夜夜添夜夜av| 国产a级片网站| 69精品小视频| 国产精品777| 91禁国产网站| 91精品在线国产| 97久久天天综合色天天综合色hd| 国产乱人伦精品一区二区三区| 国模精品视频一区二区| 欧洲熟妇精品视频| 青草视频在线观看视频| 日韩精品―中文字幕| 日本精品性网站在线观看| 日本精品久久久| 青草青草久热精品视频在线网站 | 久久综合亚洲精品| 久久久最新网址| 国产成人av网| 久久国产午夜精品理论片最新版本| 久久偷窥视频| 久久av一区二区三区亚洲| 久久精品日韩| 色偷偷88888欧美精品久久久| 日韩一级黄色av| 国产精品无码一区二区在线| 国产精品久久国产精品99gif| 国产精品福利无圣光在线一区| 欧美成人精品一区| 欧美精品久久一区二区| 一区二区免费电影| 亚洲精品高清视频| 日韩视频在线视频| 欧美日韩在线成人| 国产在线98福利播放视频| 国产麻豆一区二区三区在线观看 | 久久久久久国产精品一区| 久久精品国产一区| 国产精品露脸自拍| 国产99视频精品免视看7| 一区二区三区电影| 日韩在线视频在线观看| 欧美精品色婷婷五月综合| 麻豆中文字幕在线观看| 粉嫩精品一区二区三区在线观看| 久久久一本精品99久久精品| 日韩在线视频播放| 欧美另类第一页| 亚洲mm色国产网站| 日韩精品一区二区三区电影 | 亚洲欧美日韩另类精品一区二区三区 | 日韩欧美第二区在线观看| 欧美国产日韩在线播放| 国产综合在线视频| 99久久精品免费看国产一区二区三区 | 日韩视频在线观看视频| 国产主播精品在线| 国产精品91久久久久久| 久久久精品在线观看| 影音先锋欧美在线| 日韩精品欧美一区二区三区| 精品一区二区视频| 91国内揄拍国内精品对白| 国产精品视频xxxx| 亚洲精品中文综合第一页| 欧美午夜精品久久久久久蜜| 国产精品自拍偷拍| 丝袜美腿亚洲一区二区| 欧美老少配视频| 日本免费a视频| 国产欧美丝袜| 色婷婷综合成人| 亚洲一区二区三区四区在线播放 | 日韩 欧美 自拍| 国产内射老熟女aaaa| 日韩在线视频国产| 亚洲一区二区在线播放| 国模精品视频一区二区三区| 国产成人a亚洲精品| 蜜月aⅴ免费一区二区三区| 日韩高清专区| chinese少妇国语对白| 国产成人精品一区二区三区福利 | 国产精品亚洲视频在线观看| 国产精品免费在线| 日韩视频 中文字幕| 91成人精品网站| 欧美精品xxx| 国内精品久久久久久中文字幕| 久久人人爽人人|