BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Office of Advanced Research Computing - ECPv6.0.10//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Office of Advanced Research Computing
X-ORIGINAL-URL:https://oarc.rutgers.edu
X-WR-CALDESC:Events for Office of Advanced Research Computing
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20210314T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20211107T060000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/New_York:20210928T130000
DTEND;TZID=America/New_York:20210928T160000
DTSTAMP:20260405T001822
CREATED:20210813T203720Z
LAST-MODIFIED:20210921T150237Z
UID:5451-1632834000-1632844800@oarc.rutgers.edu
SUMMARY:Machine Learning Series by OARC—Random Forest Workshop
DESCRIPTION:Please register to attend this workshop at the bottom of this page. After filling out the registration form\, we will email you the Zoom link. \nAmarel account: You need an Amarel account to participate in the lab section Apply here as soon as possible. \nVPN setup: You have to be connected on Rutgers’ network or be on VPN to access Amarel resources. \nIf you have questions or need help\, please contact Janet Chang. \nTopics:\n1. Introduction to Machine Learning (ML) \n\nBig data and Machine Learning\nRelate Machine Learning to other disciplines\nMachine Learning algorithms\nClassification and Regression\n\n2. Understanding Random Forest (RF) \n\nApplications of Random Forests\nWhy Random Forests\nThe Random Forest Algorithm\nFundamental concepts – ML\, RF\n\n3. Implementing Random Forest \n\nFeature Importance and Feature Selection\nDealing with missing data\, and imbalanced data\nBest split of the node–node impurity\n\n\nOver-fitting and underfitting\nThe model performance\nThe model interpretability\n\n4. Lab Exercise \nWe will use cancer health data combined with gene expression data to build random forest models\, predicting output variables. Both classifier and regressor will be addressed. \nLab 1\, Set up\, and launch R \nLab 2\, Data preparation \n\n2a. pre-processing\,\n2b. data partition\,\n2c. missing data imputation\,\n2d. feature selection\n\nLab 3\, Building the RF model: \n\n3b. handling imbalanced data\n3c. building the RF model\n3d. turning the parameters\n\nLab 4\, Validation and model performance \n\n4a. prediction and Confusion Matrix — test data\n4b. ROC curve and AUC\n4c. k-fold cross-validation\n4d. parallel computing\n\nLab 5\, Visualization and the model interpretation \n\n5a. plotting the random forest tree\n5b. plotting feature importance\n5c. partial dependence plot (PDP)\n5d. MDS – multi-dimensional scaling plot of proximity matrix\n\n \nTo Register: \n\n	Notice: JavaScript is required for this content.
URL:https://oarc.rutgers.edu/event/machine-learning-series-by-oarc-random-forest-workshop-9-28-21/
END:VEVENT
END:VCALENDAR