Parallel Processing with Python

By |

Modern computers have a CPU with multiple cores (usually between 4-8). Come learn how to take advantage of them to parallelize and speed up your code. We’ll show you how to structure your code so you can parallelize it in 5 lines or less. We will also cover some theory, a few practical considerations along with some basic exercises. We’ll be using the multiprocessing module in Python. The workshop is intended for users with basic Python knowledge. The workshop assumes you know how to do the following in Python: i) write a for loop, ii) write a function that has inputs and outputs.  Anaconda Python 3.5 will be used.

Regular Expressions II

By |

Regular expressions are perfectly suited for people who like puzzles. Regular expressions are a sequence of characters used to define a search pattern. They are commonly used to do “find” and “find and replace” string operations. They are also used to validate strings like phone numbers, passwords, etc. in data entry. Regular expression capabilities can be found in a variety of programming languages and software like ArcGIS, Java, Javascript, Matlab, Perl, PHP, Python, R, Visual Basic, etc. and some text editors. This workshop is part II of a two-part series and will cover more advanced topics like captured groups, backreferences and assertions. The workshop will consist of hands-on example problems. Basic understanding of regular expressions is required. You should be able to understand expressions like “w{3,}-d{1,2}-d{4}“ and “des*ert?s?”. The tutorials will be conducted using Python. A basic programming background is helpful but not required for this workshop.

CSCAR Data Science Skills Series adds session on Pandas case studies — Feb. 17

By | Educational, Events

CSCAR is offering a series of workshops on data science skills using Python. The workshops will be held in the Earl Lewis room in the Rackham building. All workshops will take place on Wednesday afternoons from 3:30-5.

No registration is necessary and there is no fee to attend. Please note: A new section has been added on Pandas case studies.

Schedule:

  • January 27: Data management with Pandas
  • February 10: Graphics and data visualization with Matplotlib and Bokeh
  • NEW: February 17: Pandas case studies (CMS data analyses)
  • February 24: Basic statistical analysis with Statsmodels
  • March 9: Sklearn for predictive analysis and data exploration
  • March 23: Advanced regression analysis (GEE, mixed models and multiple imputation) with Statsmodels
  • April 6: Survival analysis with Statsmodels

Additional workshops will be scheduled on the following topics, dates to be announced:

  • Geospatial analysis
  • Building and accessing databases
  • MPI, parallel, and distributed computing

Class material will be posted on the series website.

Workshops: Data Science Skills Series (Python) — Jan. 27 through April 6

By | Educational, Events

CSCAR will offer a series of workshops on data science skills using Python. The workshops will be held in the Earl Lewis room in the Rackham building. All workshops will take place on Wednesday afternoons from 3:30-5.

The workshops are free and no registration is necessary.

Schedule:

  • January 27: Data management with Pandas
  • February 10: Graphics and data visualization with Matplotlib and Bokeh
  • February 24: Basic statistical analysis with Statsmodels
  • March 9: Sklearn for predictive analysis and data exploration
  • March 23: Advanced regression analysis (GEE, mixed models and multiple imputation) with Statsmodels
  • April 6: Survival analysis with Statsmodels

Additional workshops will be scheduled on the following topics, dates to be announced:

  • Geospatial analysis
  • Building and accessing databases
  • MPI, parallel, and distributed computing

Class material will be posted on the series website.

Introduction to Python — Sept. 14 – 21

By | Educational, Events

Emanuel Gull, assistant professor of physics, is offering an Introduction to Python mini-course from Sept. 14-21. The target audience is incoming graduate students in physics, but students from other disciplines are welcome.

The schedule is as follows:

  • Monday, Sept. 14, 8:30 – 10 a.m.: Welcome & Getting Started (hello.py)
  • Tuesday, Sept. 15, 8:30 – 10 a.m.: Numbers, Strings, Lists, Dictionaries, Tuples, Functions, Modules, Control flow
  • Wed., Sept. 16, 8:30 – 10 a.m.: Useful Python libraries (part I): numpy, scipy, matplotlib
  • Mon., Sept. 21, 8:30 – 10 a.m.: Useful Python libraries (part 2): 3d plotting in matplotlib and exercises

The course will be held in the Clarence Cook Little building, Room 1505. Visit this web page for more information and details.

 

CSCAR Python workshop on Regression — June 11-12

By | Educational, Events

CSCAR is offering an upcoming workshop on Python. Registration is not required.

Python Regression Workshop
June 11-12
2-4 p.m. each day
4th Floor East Conference Room, Rackham Building

The workshop will focus on the use of Python and the Statsmodels library for regression analysis.

Participants should be familiar with basic Python or at minimum with another data-oriented programming language such as R. The CSCAR Python data management workshop would provide useful background in preparing your data for analysis.

Background materials are available on the workshop website.

Workshop on Data Management in Python — Feb. 10-12 at Rackham

By | Educational, Events

CSCAR will offer a workshop on data management in Python on February 10, 11, and 12 from 4-6 p.m. each day, in the Rackham common room (Rackham Building lower level west).

The workshop will focus on using core Python, numpy, and Pandas to manage and process data sets. Participants will learn how to read and clean data sets, generate reports, produce graphical summaries, and perform simple statistical analyses.

This workshop will have a lecture/discussion format and is not held in a computer lab. Participants may bring their own laptops if they wish but this is not required. All software discussed in the workshop is free and open source, and runs on all major platforms. Follow-up consulting for UM researchers using Python in their research is available from CSCAR.

There is no registration for this workshop, and there is no charge to attend. Participants should plan to attend all three sessions.

CSCAR offering workshop on regression modeling in Python — Jan. 7 and 8

By | Educational, Events

CSCAR will offer a two-part workshop on regression modeling in Python. The main focus will be on linear models and generalized linear models (logistic, Poisson, and negative binomial regression). The statistical background will be reviewed briefly, but the main emphasis will be on fitting regression models using Python and its scientific libraries. All software discussed in the workshop is free and open source, and runs on all major platforms. There is no charge to attend the workshop.

For more information, contact cscar@umich.edu.

Training materials are available at: https://github.com/kshedden/Python-regression-workshop/wiki

Date: Jan. 7 and 8 (participants should plan to attend both days)

Time: 4 – 6 p.m.

Location: Rackham Commons Room, Lower Level.