BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Office of Advanced Research Computing - ECPv6.0.10//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Office of Advanced Research Computing
X-ORIGINAL-URL:https://oarc.rutgers.edu
X-WR-CALDESC:Events for Office of Advanced Research Computing
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20220313T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20221106T060000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/New_York:20221111T140000
DTEND;TZID=America/New_York:20221111T160000
DTSTAMP:20260430T050701
CREATED:20220809T195251Z
LAST-MODIFIED:20221128T162518Z
UID:7440-1668175200-1668182400@oarc.rutgers.edu
SUMMARY:Python for Big Data - Part 1
DESCRIPTION:Workshop content: In recent years\, Python has become one of the top programming languages for doing data analysis due to its inherent advantages such as simplicity\, readability\, portability\, etc.\, However\, Python is slow compared to C or Fortran\, and it does not manage memory well. These limitations\, with speed and memory management\, may not be significant when analyzing small datasets\, but they become bottlenecks when analyzing big datasets. \nTo address the challenges associated with big data analytics\, the Python community developed and tested several techniques. In this workshop\, we will go through some of these techniques including vectorization\, parallelization\, just in time compilation\, and distributed task executions. We will do hands-on exercises to emphasize the following solutions. \nObjectives \n\nHow to speed up the data analysis?\nWhat to do when the data set size exceeds the available physical memory?\nHow to distribute the workloads when doing machine learning for big data sets?\n\nIf you have questions or need help\, please email Bala Desinghu. \nAmarel account: Apply here as soon as possible. You must have an Amarel account set up before the workshop. \nVPN setup: You have to be connected on Rutgers’ network or be on VPN to access Amarel resources. \nSSH setup: Windows users must install an SSH client like PuTTY or MobaXterm. Alternatively\, Windows 10 users can install the complete Windows Subsystem for Linux.
URL:https://oarc.rutgers.edu/event/python-for-big-data-11-11-22/
END:VEVENT
END:VCALENDAR