{ "cells": [ { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [ { "data": { "text/html": [ "\n" ], "text/plain": [ "" ] }, "execution_count": 1, "metadata": {}, "output_type": "execute_result" } ], "source": [ "from IPython.display import HTML\n", "import requests\n", "HTML(requests.get(\"https://git.io/fh5WI\").text)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", "

Python and Jupyter for Programmers

\n", "

Part 2: Web Scrapping with Requests and BeautifulSoup

\n", "
\n", "\n", "\n", "\n", "
\n", "

Contents

\n", "
\n", "
\n", "
    \n", "
  1. Project
  2. \n", "
  3. Requests
  4. \n", "
  5. Beautiful Soup
  6. \n", "
  7. String Processing
  8. \n", "
      \n", "
\n", "
" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", "go top\n", "\n", "

\n", "1. Project\n", "

\n", "\n", "\n", "Here is a page with a directory of graduate students at the UB Math Department: \n", "
\n",
    "http://www.buffalo.edu/cas/math/people/grad-directory.html\n",
    "
\n", "\n", "\n", "\n", "Objective: Retrieve names and email addresses of all students. \n", "
" ] }, { "cell_type": "code", "execution_count": 1, "metadata": { "scrolled": false }, "outputs": [ { "data": { "text/html": [ "\n", " \n", " " ], "text/plain": [ "" ] }, "execution_count": 1, "metadata": {}, "output_type": "execute_result" } ], "source": [ "import IPython\n", "url = \"http://www.buffalo.edu/cas/math/people/grad-directory.html\"\n", "IPython.display.IFrame(url, width = 1000, height = 600)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", "go top\n", "\n", "

\n", "2. Requests\n", "

\n", "\n", "\n", "
\n", "\n", "
    \n", "
  • Requests is a Python library for interacting with web pages.
  • \n", "
  • A short introduction to requests is available here.
  • \n", "
\n", "\n", "
" ] }, { "cell_type": "code", "execution_count": 1, "metadata": {}, "outputs": [], "source": [ "import requests" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "" ] }, { "cell_type": "code", "execution_count": 2, "metadata": {}, "outputs": [], "source": [ "grad_page = requests.get(\"http://www.buffalo.edu/cas/math/people/grad-directory.html\")" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Header information:" ] }, { "cell_type": "code", "execution_count": 3, "metadata": {}, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "{'Date': 'Mon, 04 Mar 2019 22:38:04 GMT', 'Server': 'Apache/2.4.6 () Communique/4.2.3 OpenSSL/1.0.2k-fips', 'Last-Modified': 'Mon, 04 Mar 2019 22:38:04 GMT', 'ETag': 'W/\"97c1-5834c68e9f13b-gzip\"', 'Accept-Ranges': 'bytes', 'Vary': 'Accept-Encoding,User-Agent', 'Content-Encoding': 'gzip', 'Cache-Control': 'max-age=10', 'Expires': 'Mon, 04 Mar 2019 22:38:14 GMT', 'Content-Length': '8291', 'Keep-Alive': 'timeout=5, max=100', 'Connection': 'Keep-Alive', 'Content-Type': 'text/html; charset=UTF-8'}\n" ] } ], "source": [ "print(grad_page.headers)" ] }, { "cell_type": "code", "execution_count": 20, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "'Sat, 23 Feb 2019 01:27:04 GMT'" ] }, "execution_count": 20, "metadata": {}, "output_type": "execute_result" } ], "source": [ "grad_page.headers['Date']" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Text of the webpage:" ] }, { "cell_type": "code", "execution_count": 3, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/plain": [ "'Graduate Students - Department of Mathematics - University at Buffalo
Loading menu...

Graduate Students

\"UB

Mathematics Graduate Student Directory 2017-2018

A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z

A

Abeya Ranasinghe Mudiyanselage, Asela V.
Office: 129  Phone: 645-8817 
Email: aselavir@buffalo.edu

Alegria, Linda
Office: 138 Phone: 645-8823
Email: lindaale@buffalo.edu

B

Bittner, Alyson
Office: 130  Phone: 645-8818  
Email: alysonbi@buffalo.edu

C

Cain Charles
Office: 126 Phone: 645-8816
Email: ccain2@buffalo.edu

Casper, Michael
Office: 222  Phone: 645-8779
Email: mjcasper@buffalo.edu

Chang, Hong
Office: 136 Phone: 645-8821
Email: hchang24@buffalo.edu

Chen, Yen-Lin
Office: 125 Phone: 645-8815
Email: yenlinch@buffalo.edu

Cheuk, Ka Yue
Office: 140 Phone: 645-8825
Email: kayueche@buffalo.edu

Chou, Han-Jung
Office:140 Phone: 645-8825
Email: hanjungc@buffalo.edu

Cosgrove, Gage (Makenzie)
Office: 139 Phone: 645-8824
Email: gagecosg@buffalo.edu

D

Deutsch, Dustin
Office: 140  Phone: 645-8825
Email: dustinde@buffalo.edu

Dey, Subhankar
Office: 140  Phone: 645-8825
Email: subhanka@buffalo.edu

Doga, Hakan
Office: 126  Phone: 645-8816
Email: hakandog@buffalo.edu

Dong, Sai
Office: 138 Phone: 645-8823
Email: saidong@buffalo.edu

E

Engelhardt, Carolyn
Office: 132 Phone: 645-8820
Email: cengelha@buffalo.edu

G

Goswami, Abhishek
Office: 125 Phone: 645-8815
Email: goswami3@buffalo.edu

Gu, Jiaxi
Office: 132  Phone: 645-8820 
Email: jiaxigu@buffalo.edu

H

Hopfensperger, John
Office: 139  Phone: 645-8824
Email: johnhopf@buffalo.edu

Hung, Tsz Fun
Office: 137  Phone: 645-8822
Email: tszfunhu@buffalo.edu

Huynh, Bao
Office: 131 Phone: 645-8819
Email: baohuynh@buffalo.edu

J

Jeong, Myonggin
Office: 132 Phone: 645-8820
Email: myonggin@buffalo.edu

Jin, Xifeng
Office: 137  Phone: 645-8822
Email: xifengji@buffalo.edu

Johnson, Megan
Office: 132  Phone: 645-8820
Email: meganjoh@buffalo.edu

Jones, Raymond
Office: 140  Phone: 645-8825
Email: rpjones2@buffalo.edu

K

Kilic, Bengier Ulgen
Office: 125 Phone: 645-8815
Email: bengieru@buffalo.edu

Kim, Jiseong
Office: 125 Phone: 645-8815
Email: jiseongk@buffalo.edu

Kireyev, Dmitri
Office: 138  Phone: 645-8823
Email: dmitriki@buffalo.edu

L

Le, Minh Quang
Office: 131  Phone: 645-8819
Email: minhquan@buffalo.edu

Lin, Wei
Office: 313  Phone: 645-8827
Email: wlin27@buffalo.edu

Liu, Ruodan
Office: 140 Phone: 645-8825
Email: rliu8@buffalo.edu

Liu,Tianmou
Office: 136 Phone: 645-8821
Email: tianmoul@buffalo.edu

Liu, Yuan
Office: 140 Phone: 645-8825
Email: yuanliu@buffalo.edu

Lottes, Jonathan
Office: 129 Phone: 645-8817
Email: jllottes@buffalo.edu

 

M

Ma, Ning
Office: 125  Phone: 645-8815
Email: nma22@buffalo.edu

Ma, Renda
Office: 125  Phone: 645-8815
Email: rendama@buffalo.edu

Ma, Yuqing
Office: 138  Phone: 645-8823 
Email: yuqingma@buffalo.edu

Mandava, Navya
Office: 130  Phone: 645-8818
Email: navyatejam@gmail.com

Meng, Lingqi
Office: 130 Phone: 645-8818
Email: lingqime@buffalo.edu

Montoro, Michael
Office: 126  Phone: 645-8816
Email: mnmontor@buffalo.edu

N

O

Oregero, Jeffrey
Office: 129  Phone: 645-8817 
Email: jaoreger@buffalo.edu

P

Peng, Jun
Office: 139 Phone: 645-8824
Email: jpeng3@buffalo.edu

R

Raico, Pasquale
Office: 140 Phone: 645-8825
Email: ppraico@buffalo.edu

Rozwood, Bud
Office: 125 Phone: 645-8815
Email: budrozwo@buffalo.edu

S

Sailbery, Mitchell
Office: 126 Phone: 645-8816
Email: msailsbe@buffalo.edu

Samer, Andrew
Office: 130  Phone: 645-8818
Email: agsamer@buffalo.edu

Schilder, Marc
Office: 313 Phone: 645-8827
Email: mbschild@buffalo.edu

Schmidt, Matthew
Office: 129  Phone: 645-8817 
Email: mwschmid@buffalo.edu

Shi, Ruonan
Office: 139  Phone: 645-8824
Email: ruonansh@buffalo.edu

Solanski, Deepisha
Office: 140 Phone: 645-8825
Email: deepisha@buffalo.edu

Song, Zhao
Office: 131 Phone: 645-8819
Email: zhaosong@buffalo.edu

Sullivan, Mark
Office: 136  Phone: 645-8821
Email: marksull@buffalo.edu

Sun, Yuxun
Office: 140 Phone: 645-8825
Email: yuxunsun@buffalo.edu

U

Upadhyay, Aparna
Office: 136  Phone: 645-8821
Email: aparnaup@buffalo.edu

W

Wang, Biao
Office: 125 Phone: 645-8815
Email: bwang32@buffalo.edu

Wang, Daxun
Office: 141 Phone: 645-8825
Email: daxunwan@buffalo.edu

Wang, Shiruo
Office: 125 Phone: 645-8815
Email: shiruo@buffalo.edu

Wang, Weiqi
Office: 125 Phone: 645-8815
Email: weiqiwan@buffalo.edu

Winton, Daniel
Office: 125 Phone: 645-8815
Email: dmwinton@buffalo.edu

Y

Yuan, Cheng
Office: 137  Phone: 645-8822
Email: chengyua@buffalo.edu

Z

Zalloum, Abdalrazzaq R.A.
Personal Website
Office: 140  Phone: 645-8825 
Email: abdalraz@buffalo.edu

Zeng, Naijiang
Office: 137  Phone: 645-8822
Email: naijiang@buffalo.edu

Zhou, Chen
Office: 137  Phone: 645-8822
Email: czhou5@buffalo.edu

Ziegler, Cameron
Office: 140  Phone: 645-8825
Email: cz22@buffalo.edu

\"University

244 Mathematics Building
Buffalo, NY 14260-2900
Phone: (716) 645-6284
Fax: (716) 645-5039

 University at Buffalo. All rights reserved.  |  Privacy  |  Accessibility
'" ] }, "execution_count": 3, "metadata": {}, "output_type": "execute_result" } ], "source": [ "grads = grad_page.text\n", "grads" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", "

Retrieving binary files

\n", "\n", "
\n", "A sample jpeg image from Wikipedia Commons:\n", "\n", "
\n",
    "https://bit.ly/2V8pGeS\n",
    "
\n", "\n", "
" ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [], "source": [ "r = requests.get(\"https://bit.ly/2V8pGeS\")\n", "bird = r.content" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Save content to a file:" ] }, { "cell_type": "code", "execution_count": 7, "metadata": {}, "outputs": [], "source": [ "foo = open(\"bird.jpg\", \"wb\") # open a file w=writing access, b=binary\n", "foo.write(bird) # write to the file \n", "foo.close() # close file" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", "go top\n", "\n", "

\n", "3. Beautiful Soup\n", "

\n", "\n", "\n", "
\n", "\n", "Beautiful Soup is a Python library for parsing HTML documents. \n", "\n", "
" ] }, { "cell_type": "code", "execution_count": 5, "metadata": {}, "outputs": [], "source": [ "from bs4 import BeautifulSoup" ] }, { "cell_type": "code", "execution_count": 7, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/plain": [ "'Graduate Students - Department of Mathematics - University at Buffalo
Loading menu...

Graduate Students

\"UB

Mathematics Graduate Student Directory 2017-2018

A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z

A

Abeya Ranasinghe Mudiyanselage, Asela V.
Office: 129  Phone: 645-8817 
Email: aselavir@buffalo.edu

Alegria, Linda
Office: 138 Phone: 645-8823
Email: lindaale@buffalo.edu

B

Bittner, Alyson
Office: 130  Phone: 645-8818  
Email: alysonbi@buffalo.edu

C

Cain Charles
Office: 126 Phone: 645-8816
Email: ccain2@buffalo.edu

Casper, Michael
Office: 222  Phone: 645-8779
Email: mjcasper@buffalo.edu

Chang, Hong
Office: 136 Phone: 645-8821
Email: hchang24@buffalo.edu

Chen, Yen-Lin
Office: 125 Phone: 645-8815
Email: yenlinch@buffalo.edu

Cheuk, Ka Yue
Office: 140 Phone: 645-8825
Email: kayueche@buffalo.edu

Chou, Han-Jung
Office:140 Phone: 645-8825
Email: hanjungc@buffalo.edu

Cosgrove, Gage (Makenzie)
Office: 139 Phone: 645-8824
Email: gagecosg@buffalo.edu

D

Deutsch, Dustin
Office: 140  Phone: 645-8825
Email: dustinde@buffalo.edu

Dey, Subhankar
Office: 140  Phone: 645-8825
Email: subhanka@buffalo.edu

Doga, Hakan
Office: 126  Phone: 645-8816
Email: hakandog@buffalo.edu

Dong, Sai
Office: 138 Phone: 645-8823
Email: saidong@buffalo.edu

E

Engelhardt, Carolyn
Office: 132 Phone: 645-8820
Email: cengelha@buffalo.edu

G

Goswami, Abhishek
Office: 125 Phone: 645-8815
Email: goswami3@buffalo.edu

Gu, Jiaxi
Office: 132  Phone: 645-8820 
Email: jiaxigu@buffalo.edu

H

Hopfensperger, John
Office: 139  Phone: 645-8824
Email: johnhopf@buffalo.edu

Hung, Tsz Fun
Office: 137  Phone: 645-8822
Email: tszfunhu@buffalo.edu

Huynh, Bao
Office: 131 Phone: 645-8819
Email: baohuynh@buffalo.edu

J

Jeong, Myonggin
Office: 132 Phone: 645-8820
Email: myonggin@buffalo.edu

Jin, Xifeng
Office: 137  Phone: 645-8822
Email: xifengji@buffalo.edu

Johnson, Megan
Office: 132  Phone: 645-8820
Email: meganjoh@buffalo.edu

Jones, Raymond
Office: 140  Phone: 645-8825
Email: rpjones2@buffalo.edu

K

Kilic, Bengier Ulgen
Office: 125 Phone: 645-8815
Email: bengieru@buffalo.edu

Kim, Jiseong
Office: 125 Phone: 645-8815
Email: jiseongk@buffalo.edu

Kireyev, Dmitri
Office: 138  Phone: 645-8823
Email: dmitriki@buffalo.edu

L

Le, Minh Quang
Office: 131  Phone: 645-8819
Email: minhquan@buffalo.edu

Lin, Wei
Office: 313  Phone: 645-8827
Email: wlin27@buffalo.edu

Liu, Ruodan
Office: 140 Phone: 645-8825
Email: rliu8@buffalo.edu

Liu,Tianmou
Office: 136 Phone: 645-8821
Email: tianmoul@buffalo.edu

Liu, Yuan
Office: 140 Phone: 645-8825
Email: yuanliu@buffalo.edu

Lottes, Jonathan
Office: 129 Phone: 645-8817
Email: jllottes@buffalo.edu

 

M

Ma, Ning
Office: 125  Phone: 645-8815
Email: nma22@buffalo.edu

Ma, Renda
Office: 125  Phone: 645-8815
Email: rendama@buffalo.edu

Ma, Yuqing
Office: 138  Phone: 645-8823 
Email: yuqingma@buffalo.edu

Mandava, Navya
Office: 130  Phone: 645-8818
Email: navyatejam@gmail.com

Meng, Lingqi
Office: 130 Phone: 645-8818
Email: lingqime@buffalo.edu

Montoro, Michael
Office: 126  Phone: 645-8816
Email: mnmontor@buffalo.edu

N

O

Oregero, Jeffrey
Office: 129  Phone: 645-8817 
Email: jaoreger@buffalo.edu

P

Peng, Jun
Office: 139 Phone: 645-8824
Email: jpeng3@buffalo.edu

R

Raico, Pasquale
Office: 140 Phone: 645-8825
Email: ppraico@buffalo.edu

Rozwood, Bud
Office: 125 Phone: 645-8815
Email: budrozwo@buffalo.edu

S

Sailbery, Mitchell
Office: 126 Phone: 645-8816
Email: msailsbe@buffalo.edu

Samer, Andrew
Office: 130  Phone: 645-8818
Email: agsamer@buffalo.edu

Schilder, Marc
Office: 313 Phone: 645-8827
Email: mbschild@buffalo.edu

Schmidt, Matthew
Office: 129  Phone: 645-8817 
Email: mwschmid@buffalo.edu

Shi, Ruonan
Office: 139  Phone: 645-8824
Email: ruonansh@buffalo.edu

Solanski, Deepisha
Office: 140 Phone: 645-8825
Email: deepisha@buffalo.edu

Song, Zhao
Office: 131 Phone: 645-8819
Email: zhaosong@buffalo.edu

Sullivan, Mark
Office: 136  Phone: 645-8821
Email: marksull@buffalo.edu

Sun, Yuxun
Office: 140 Phone: 645-8825
Email: yuxunsun@buffalo.edu

U

Upadhyay, Aparna
Office: 136  Phone: 645-8821
Email: aparnaup@buffalo.edu

W

Wang, Biao
Office: 125 Phone: 645-8815
Email: bwang32@buffalo.edu

Wang, Daxun
Office: 141 Phone: 645-8825
Email: daxunwan@buffalo.edu

Wang, Shiruo
Office: 125 Phone: 645-8815
Email: shiruo@buffalo.edu

Wang, Weiqi
Office: 125 Phone: 645-8815
Email: weiqiwan@buffalo.edu

Winton, Daniel
Office: 125 Phone: 645-8815
Email: dmwinton@buffalo.edu

Y

Yuan, Cheng
Office: 137  Phone: 645-8822
Email: chengyua@buffalo.edu

Z

Zalloum, Abdalrazzaq R.A.
Personal Website
Office: 140  Phone: 645-8825 
Email: abdalraz@buffalo.edu

Zeng, Naijiang
Office: 137  Phone: 645-8822
Email: naijiang@buffalo.edu

Zhou, Chen
Office: 137  Phone: 645-8822
Email: czhou5@buffalo.edu

Ziegler, Cameron
Office: 140  Phone: 645-8825
Email: cz22@buffalo.edu

\"University

244 Mathematics Building
Buffalo, NY 14260-2900
Phone: (716) 645-6284
Fax: (716) 645-5039

 University at Buffalo. All rights reserved.  |  Privacy  |  Accessibility
'" ] }, "execution_count": 7, "metadata": {}, "output_type": "execute_result" } ], "source": [ "grads" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", "Parse the page:\n", "
" ] }, { "cell_type": "code", "execution_count": 6, "metadata": {}, "outputs": [], "source": [ "soup = BeautifulSoup(grads)" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", "Print the page code neatly:\n", "
" ] }, { "cell_type": "code", "execution_count": 7, "metadata": { "scrolled": true }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "\n", "\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " Graduate Students - Department of Mathematics - University at Buffalo\n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", " \n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", "
\n", "
\n", " \n", "
\n", "
\n", " \n", " \n", "
\n", "
\n", " \n", "
\n", "
\n", "
\n", " \n", " \n", " \n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", " \n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " Loading menu...\n", " \n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", "
\n", "
\n", " \n", " \n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "

\n", " Graduate Students\n", "

\n", "
\n", "
\n", "
\n", " \n", " \n", " \n", " \"UB\n", " \n", " \n", " \n", "
\n", "
\n", "
\n", "

\n", " Mathematics Graduate Student Directory 2017-2018\n", "

\n", "
\n", "
\n", "

\n", " \n", " A\n", " \n", " |\n", " \n", " B\n", " \n", " |\n", " \n", " C\n", " \n", " |\n", " \n", " D\n", " \n", " | E |\n", " \n", " F\n", " \n", " |\n", " \n", " G\n", " \n", " |\n", " \n", " H\n", " \n", " | I |\n", " \n", " J\n", " \n", " |\n", " \n", " K\n", " \n", " |\n", " \n", " L\n", " \n", " |\n", " \n", " M\n", " \n", " |\n", " \n", " N\n", " \n", " |\n", " \n", " O\n", " \n", " |\n", " \n", " P\n", " \n", " | Q |\n", " \n", " R\n", " \n", " |\n", " \n", " S\n", " \n", " |\n", " \n", " T\n", " \n", " |\n", " \n", " U\n", " \n", " |\n", " \n", " V\n", " \n", " |\n", " \n", " W\n", " \n", " |\n", " \n", " X\n", " \n", " |\n", " \n", " Y\n", " \n", " |\n", " \n", " Z\n", " \n", "

\n", "
\n", "
\n", "
\n", "
\n", "

\n", " A\n", "

\n", "
\n", "
\n", "

\n", " \n", " Abeya Ranasinghe Mudiyanselage, Asela V.\n", " \n", "
\n", " Office: 129  Phone: 645-8817\n", "
\n", " Email: aselavir@buffalo.edu\n", "
\n", "

\n", "

\n", " \n", " Alegria, Linda\n", " \n", "
\n", " Office: 138 Phone: 645-8823\n", "
\n", " Email: lindaale@buffalo.edu\n", "
\n", "

\n", "
\n", "
\n", "
\n", "
\n", "

\n", " B\n", "

\n", "
\n", "
\n", "

\n", " \n", " Bittner, Alyson\n", " \n", "
\n", " Office: 130  Phone: 645-8818\n", "
\n", " Email: alysonbi@buffalo.edu\n", "

\n", "
\n", " \n", "
\n", "
\n", "
\n", "

\n", " C\n", "

\n", "
\n", "
\n", "

\n", " \n", " Cain Charles\n", " \n", "
\n", " Office: 126 Phone: 645-8816\n", "
\n", " Email: ccain2@buffalo.edu\n", "

\n", "

\n", " \n", " Casper, Michael\n", "
\n", "
\n", " Office: 222  Phone: 645-8779\n", "
\n", " Email: mjcasper@buffalo.edu\n", "

\n", "

\n", " \n", " Chang, Hong\n", "
\n", "
\n", " Office: 136 Phone: 645-8821\n", "
\n", " Email: hchang24@buffalo.edu\n", "

\n", "

\n", " \n", " Chen, Yen-Lin\n", "
\n", "
\n", " Office: 125 Phone: 645-8815\n", "
\n", " Email: yenlinch@buffalo.edu\n", "
\n", "

\n", "

\n", " \n", " Cheuk, Ka Yue\n", "
\n", "
\n", " Office: 140 Phone: 645-8825\n", "
\n", " Email: kayueche@buffalo.edu\n", "

\n", "

\n", " \n", " Chou, Han-Jung\n", "
\n", "
\n", " Office:140 Phone: 645-8825\n", "
\n", " Email: hanjungc@buffalo.edu\n", "

\n", "

\n", " \n", " Cosgrove, Gage (Makenzie)\n", "
\n", "
\n", " Office: 139 Phone: 645-8824\n", "
\n", " Email: gagecosg@buffalo.edu\n", "

\n", "
\n", " \n", "
\n", "
\n", "
\n", "

\n", " D\n", "

\n", "
\n", "
\n", "

\n", " \n", " Deutsch, Dustin\n", "
\n", "
\n", " Office: 140  Phone: 645-8825\n", "
\n", " Email: dustinde@buffalo.edu\n", "

\n", "

\n", " \n", " Dey, Subhankar\n", " \n", "
\n", " Office: 140  Phone: 645-8825\n", "
\n", " Email: subhanka@buffalo.edu\n", "

\n", "

\n", " \n", " Doga, Hakan\n", " \n", "
\n", " Office: 126  Phone: 645-8816\n", "
\n", " Email: hakandog@buffalo.edu\n", "

\n", "

\n", " \n", " Dong, Sai\n", "
\n", "
\n", " Office: 138 Phone: 645-8823\n", "
\n", " Email: saidong@buffalo.edu\n", "

\n", "
\n", " \n", "
\n", "

\n", " E\n", "

\n", "
\n", "
\n", "

\n", " \n", " Engelhardt, Carolyn\n", "
\n", "
\n", " Office: 132 Phone: 645-8820\n", "
\n", " Email: cengelha@buffalo.edu\n", "
\n", "

\n", "
\n", "
\n", "

\n", " G\n", "

\n", "
\n", "
\n", "

\n", " \n", " Goswami, Abhishek\n", "
\n", "
\n", " Office: 125 Phone: 645-8815\n", "
\n", " Email: goswami3@buffalo.edu\n", "

\n", "

\n", " \n", " Gu, Jiaxi\n", " \n", "
\n", " Office: 132  Phone: 645-8820\n", "
\n", " Email: jiaxigu@buffalo.edu\n", "

\n", "
\n", "
\n", "

\n", " H\n", "

\n", "
\n", "
\n", "

\n", " \n", " Hopfensperger, John\n", " \n", "
\n", " Office: 139  Phone: 645-8824\n", "
\n", " Email: johnhopf@buffalo.edu\n", "

\n", "

\n", " \n", " Hung, Tsz Fun\n", " \n", "
\n", " Office: 137  Phone: 645-8822\n", "
\n", " Email: tszfunhu@buffalo.edu\n", "

\n", "

\n", " \n", " Huynh, Bao\n", "
\n", "
\n", " Office: 131 Phone: 645-8819\n", "
\n", " Email: baohuynh@buffalo.edu\n", "

\n", "
\n", " \n", "
\n", "

\n", " J\n", "

\n", "
\n", "
\n", "

\n", " \n", " Jeong, Myonggin\n", "
\n", "
\n", " Office: 132 Phone: 645-8820\n", "
\n", " Email: myonggin@buffalo.edu\n", "
\n", "

\n", "

\n", " \n", " Jin, Xifeng\n", " \n", "
\n", " Office: 137  Phone: 645-8822\n", "
\n", " Email: xifengji@buffalo.edu\n", "

\n", "

\n", " \n", " Johnson, Megan\n", " \n", "
\n", " Office: 132  Phone: 645-8820\n", "
\n", " Email: meganjoh@buffalo.edu\n", "

\n", "

\n", " \n", " Jones, Raymond\n", " \n", "
\n", " Office: 140  Phone: 645-8825\n", "
\n", " Email: rpjones2@buffalo.edu\n", "

\n", "
\n", "
\n", "

\n", " K\n", "

\n", "
\n", "
\n", "

\n", " \n", " Kilic, Bengier Ulgen\n", "
\n", "
\n", " Office: 125 Phone: 645-8815\n", "
\n", " Email: bengieru@buffalo.edu\n", "

\n", "

\n", " \n", " Kim, Jiseong\n", "
\n", "
\n", " Office: 125 Phone: 645-8815\n", "
\n", " Email: jiseongk@buffalo.edu\n", "

\n", "

\n", " \n", " Kireyev, Dmitri\n", " \n", "
\n", " Office: 138  Phone: 645-8823\n", "
\n", " Email: dmitriki@buffalo.edu\n", "
\n", "

\n", "
\n", " \n", "
\n", "

\n", " L\n", "

\n", "
\n", "
\n", "

\n", " \n", " Le, Minh Quang\n", " \n", "
\n", " Office: 131  Phone: 645-8819\n", "
\n", " Email: minhquan@buffalo.edu\n", "
\n", "

\n", "

\n", " \n", " Lin, Wei\n", " \n", "
\n", " Office: 313  Phone: 645-8827\n", "
\n", " Email: wlin27@buffalo.edu\n", "
\n", "

\n", "

\n", " \n", " Liu, Ruodan\n", " \n", "
\n", " Office: 140 Phone: 645-8825\n", "
\n", " Email: rliu8@buffalo.edu\n", "

\n", "

\n", " \n", " Liu,Tianmou\n", "
\n", "
\n", " Office: 136 Phone: 645-8821\n", "
\n", " Email: tianmoul@buffalo.edu\n", "

\n", "

\n", " \n", " Liu, Yuan\n", " \n", "
\n", " Office: 140 Phone: 645-8825\n", "
\n", " Email: yuanliu@buffalo.edu\n", "

\n", "

\n", " \n", " Lottes, Jonathan\n", " \n", "
\n", " Office: 129 Phone: 645-8817\n", "
\n", " Email: jllottes@buffalo.edu\n", "

\n", "

\n", "

\n", "
\n", " \n", "
\n", "

\n", " M\n", "

\n", "
\n", "
\n", "

\n", " \n", " Ma, Ning\n", "
\n", "
\n", " Office: 125  Phone: 645-8815\n", "
\n", " Email: nma22@buffalo.edu\n", "
\n", "

\n", "

\n", " \n", " Ma, Renda\n", "
\n", "
\n", " Office: 125  Phone: 645-8815\n", "
\n", " Email: rendama@buffalo.edu\n", "

\n", "

\n", " \n", " Ma, Yuqing\n", " \n", "
\n", " Office: 138  Phone: 645-8823\n", "
\n", " Email: yuqingma@buffalo.edu\n", "

\n", "

\n", " \n", " Mandava, Navya\n", "
\n", "
\n", " Office: 130  Phone: 645-8818\n", "
\n", " Email: navyatejam@gmail.com\n", "

\n", "

\n", " \n", " Meng, Lingqi\n", "
\n", "
\n", " Office: 130 Phone: 645-8818\n", "
\n", " Email: lingqime@buffalo.edu\n", "

\n", "

\n", " \n", " Montoro, Michael\n", "
\n", "
\n", " Office: 126  Phone: 645-8816\n", "
\n", " Email: mnmontor@buffalo.edu\n", "
\n", "

\n", "
\n", "
\n", "

\n", " N\n", "

\n", "
\n", "
\n", "
\n", " \n", "
\n", "

\n", " O\n", "

\n", "
\n", "
\n", "

\n", " \n", " Oregero, Jeffrey\n", " \n", "
\n", " Office: 129  Phone: 645-8817\n", "
\n", " Email: jaoreger@buffalo.edu\n", "

\n", "
\n", "
\n", "

\n", " P\n", "

\n", "
\n", "
\n", "

\n", " \n", " Peng, Jun\n", " \n", "
\n", " Office: 139 Phone: 645-8824\n", " \n", "
\n", "
\n", " Email: jpeng3@buffalo.edu\n", "
\n", "

\n", "
\n", "
\n", "

\n", " R\n", "

\n", "
\n", "
\n", "

\n", " \n", " Raico, Pasquale\n", "
\n", "
\n", " Office: 140 Phone: 645-8825\n", "
\n", " Email: ppraico@buffalo.edu\n", "

\n", "

\n", " \n", " Rozwood, Bud\n", "
\n", "
\n", " Office: 125 Phone: 645-8815\n", " \n", "
\n", "
\n", " Email: budrozwo@buffalo.edu\n", "

\n", "
\n", "
\n", "

\n", " S\n", "

\n", "
\n", "
\n", "

\n", " \n", " Sailbery, Mitchell\n", " \n", "
\n", " Office: 126 Phone: 645-8816\n", "
\n", " Email: msailsbe@buffalo.edu\n", "

\n", "

\n", " \n", " Samer, Andrew\n", "
\n", "
\n", " Office: 130  Phone: 645-8818\n", "
\n", " Email: agsamer@buffalo.edu\n", "

\n", "

\n", " \n", " Schilder, Marc\n", " \n", "
\n", " Office: 313 Phone: 645-8827\n", "
\n", " Email: mbschild@buffalo.edu\n", "

\n", "

\n", " \n", " Schmidt, Matthew\n", " \n", "
\n", " Office: 129  Phone: 645-8817\n", "
\n", " Email: mwschmid@buffalo.edu\n", "

\n", "

\n", " \n", " Shi, Ruonan\n", " \n", "
\n", " Office: 139  Phone: 645-8824\n", "
\n", " Email: ruonansh@buffalo.edu\n", "

\n", "

\n", " \n", " Solanski, Deepisha\n", " \n", "
\n", " Office: 140 Phone: 645-8825\n", "
\n", " Email: deepisha@buffalo.edu\n", "

\n", "

\n", " \n", " Song, Zhao\n", "
\n", "
\n", " Office: 131 Phone: 645-8819\n", "
\n", " Email: zhaosong@buffalo.edu\n", "

\n", "

\n", " \n", " Sullivan, Mark\n", " \n", "
\n", " Office: 136  Phone: 645-8821\n", "
\n", " Email: marksull@buffalo.edu\n", "

\n", "

\n", " \n", " Sun, Yuxun\n", " \n", "
\n", " Office: 140 Phone: 645-8825\n", "
\n", " Email: yuxunsun@buffalo.edu\n", "

\n", "
\n", " \n", "
\n", "

\n", " U\n", "

\n", "
\n", "
\n", "

\n", " \n", " Upadhyay, Aparna\n", " \n", "
\n", " Office: 136  Phone: 645-8821\n", "
\n", " Email: aparnaup@buffalo.edu\n", "

\n", "
\n", "
\n", "

\n", " W\n", "

\n", "
\n", "
\n", "

\n", " \n", " Wang, Biao\n", "
\n", "
\n", " Office: 125 Phone: 645-8815\n", "
\n", " Email: bwang32@buffalo.edu\n", "

\n", "

\n", " \n", " Wang, Daxun\n", "
\n", "
\n", " Office: 141 Phone: 645-8825\n", "
\n", " Email: daxunwan@buffalo.edu\n", "

\n", "

\n", " \n", " Wang, Shiruo\n", "
\n", "
\n", " Office: 125 Phone: 645-8815\n", "
\n", " Email: shiruo@buffalo.edu\n", "

\n", "

\n", " \n", " Wang, Weiqi\n", "
\n", "
\n", " Office: 125 Phone: 645-8815\n", "
\n", " Email: weiqiwan@buffalo.edu\n", "

\n", "

\n", " \n", " Winton, Daniel\n", "
\n", "
\n", " Office: 125 Phone: 645-8815\n", "
\n", " Email: dmwinton@buffalo.edu\n", "
\n", "

\n", "
\n", " \n", "
\n", "

\n", " Y\n", "

\n", "
\n", "
\n", "

\n", " \n", " Yuan, Cheng\n", " \n", "
\n", " Office: 137  Phone: 645-8822\n", "
\n", " Email: chengyua@buffalo.edu\n", "

\n", "
\n", "
\n", "

\n", " Z\n", "

\n", "
\n", "
\n", "

\n", " \n", " Zalloum, Abdalrazzaq R.A.\n", "
\n", "
\n", " \n", " Personal Website\n", " \n", "
\n", " Office: 140  Phone: 645-8825\n", "
\n", " Email: abdalraz@buffalo.edu\n", "

\n", "

\n", " \n", " Zeng, Naijiang\n", " \n", "
\n", " Office: 137  Phone: 645-8822\n", "
\n", " Email: naijiang@buffalo.edu\n", "

\n", "

\n", " \n", " Zhou, Chen\n", " \n", "
\n", " Office: 137  Phone: 645-8822\n", "
\n", " Email: czhou5@buffalo.edu\n", "

\n", "

\n", " \n", " Ziegler, Cameron\n", " \n", "
\n", " Office: 140  Phone: 645-8825\n", "
\n", " Email: cz22@buffalo.edu\n", "

\n", "
\n", " \n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", "
\n", "
\n", "
\n", "
\n", " \n", " \"University\n", " \n", "
\n", "
\n", "
\n", "
\n", "

\n", " \n", " Department of Mathematics\n", " \n", "

\n", "
\n", "
\n", "

\n", " 244 Mathematics Building\n", "
\n", " Buffalo, NY 14260-2900\n", "
\n", " Phone: (716) 645-6284\n", "
\n", " Fax: (716) 645-5039\n", "

\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", " \n", " \n", " University at Buffalo\n", " \n", " . All rights reserved.  |\n", " \n", " Privacy\n", " \n", " |\n", " \n", " Accessibility\n", " \n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", " \n", " \n", "
\n", "
\n", "
\n", "
\n", "
\n", "
\n", " \n", "\n" ] } ], "source": [ "print(soup.prettify())" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", "Find all HTML <p>...</p> tags:\n", "
" ] }, { "cell_type": "code", "execution_count": 10, "metadata": { "scrolled": true }, "outputs": [ { "data": { "text/plain": [ "[

A | B | C | D | E | F | G | H | I | J | K | L | M | N | O | P | Q | R | S | T | U | V | W | X | Y | Z

,\n", "

Abeya Ranasinghe Mudiyanselage, Asela V.
Office: 129  Phone: 645-8817 
Email: aselavir@buffalo.edu

,\n", "

Alegria, Linda
Office: 138 Phone: 645-8823
Email: lindaale@buffalo.edu

,\n", "

Bittner, Alyson
Office: 130  Phone: 645-8818  
Email: alysonbi@buffalo.edu

,\n", "

Cain Charles
Office: 126 Phone: 645-8816
Email: ccain2@buffalo.edu

,\n", "

Casper, Michael
Office: 222  Phone: 645-8779
Email: mjcasper@buffalo.edu

,\n", "

Chang, Hong
Office: 136 Phone: 645-8821
Email: hchang24@buffalo.edu

,\n", "

Chen, Yen-Lin
Office: 125 Phone: 645-8815
Email: yenlinch@buffalo.edu

,\n", "

Cheuk, Ka Yue
Office: 140 Phone: 645-8825
Email: kayueche@buffalo.edu

,\n", "

Chou, Han-Jung
Office:140 Phone: 645-8825
Email: hanjungc@buffalo.edu

,\n", "

Cosgrove, Gage (Makenzie)
Office: 139 Phone: 645-8824
Email: gagecosg@buffalo.edu

,\n", "

Deutsch, Dustin
Office: 140  Phone: 645-8825
Email: dustinde@buffalo.edu

,\n", "

Dey, Subhankar
Office: 140  Phone: 645-8825
Email: subhanka@buffalo.edu

,\n", "

Doga, Hakan
Office: 126  Phone: 645-8816
Email: hakandog@buffalo.edu

,\n", "

Dong, Sai
Office: 138 Phone: 645-8823
Email: saidong@buffalo.edu

,\n", "

Engelhardt, Carolyn
Office: 132 Phone: 645-8820
Email: cengelha@buffalo.edu

,\n", "

Goswami, Abhishek
Office: 125 Phone: 645-8815
Email: goswami3@buffalo.edu

,\n", "

Gu, Jiaxi
Office: 132  Phone: 645-8820 
Email: jiaxigu@buffalo.edu

,\n", "

Hopfensperger, John
Office: 139  Phone: 645-8824
Email: johnhopf@buffalo.edu

,\n", "

Hung, Tsz Fun
Office: 137  Phone: 645-8822
Email: tszfunhu@buffalo.edu

,\n", "

Huynh, Bao
Office: 131 Phone: 645-8819
Email: baohuynh@buffalo.edu

,\n", "

Jeong, Myonggin
Office: 132 Phone: 645-8820
Email: myonggin@buffalo.edu

,\n", "

Jin, Xifeng
Office: 137  Phone: 645-8822
Email: xifengji@buffalo.edu

,\n", "

Johnson, Megan
Office: 132  Phone: 645-8820
Email: meganjoh@buffalo.edu

,\n", "

Jones, Raymond
Office: 140  Phone: 645-8825
Email: rpjones2@buffalo.edu

,\n", "

Kilic, Bengier Ulgen
Office: 125 Phone: 645-8815
Email: bengieru@buffalo.edu

,\n", "

Kim, Jiseong
Office: 125 Phone: 645-8815
Email: jiseongk@buffalo.edu

,\n", "

Kireyev, Dmitri
Office: 138  Phone: 645-8823
Email: dmitriki@buffalo.edu

,\n", "

Le, Minh Quang
Office: 131  Phone: 645-8819
Email: minhquan@buffalo.edu

,\n", "

Lin, Wei
Office: 313  Phone: 645-8827
Email: wlin27@buffalo.edu

,\n", "

Liu, Ruodan
Office: 140 Phone: 645-8825
Email: rliu8@buffalo.edu

,\n", "

Liu,Tianmou
Office: 136 Phone: 645-8821
Email: tianmoul@buffalo.edu

,\n", "

Liu, Yuan
Office: 140 Phone: 645-8825
Email: yuanliu@buffalo.edu

,\n", "

Lottes, Jonathan
Office: 129 Phone: 645-8817
Email: jllottes@buffalo.edu

,\n", "

 

,\n", "

Ma, Ning
Office: 125  Phone: 645-8815
Email: nma22@buffalo.edu

,\n", "

Ma, Renda
Office: 125  Phone: 645-8815
Email: rendama@buffalo.edu

,\n", "

Ma, Yuqing
Office: 138  Phone: 645-8823 
Email: yuqingma@buffalo.edu

,\n", "

Mandava, Navya
Office: 130  Phone: 645-8818
Email: navyatejam@gmail.com

,\n", "

Meng, Lingqi
Office: 130 Phone: 645-8818
Email: lingqime@buffalo.edu

,\n", "

Montoro, Michael
Office: 126  Phone: 645-8816
Email: mnmontor@buffalo.edu

,\n", "

Oregero, Jeffrey
Office: 129  Phone: 645-8817 
Email: jaoreger@buffalo.edu

,\n", "

Peng, Jun
Office: 139 Phone: 645-8824
Email: jpeng3@buffalo.edu

,\n", "

Raico, Pasquale
Office: 140 Phone: 645-8825
Email: ppraico@buffalo.edu

,\n", "

Rozwood, Bud
Office: 125 Phone: 645-8815
Email: budrozwo@buffalo.edu

,\n", "

Sailbery, Mitchell
Office: 126 Phone: 645-8816
Email: msailsbe@buffalo.edu

,\n", "

Samer, Andrew
Office: 130  Phone: 645-8818
Email: agsamer@buffalo.edu

,\n", "

Schilder, Marc
Office: 313 Phone: 645-8827
Email: mbschild@buffalo.edu

,\n", "

Schmidt, Matthew
Office: 129  Phone: 645-8817 
Email: mwschmid@buffalo.edu

,\n", "

Shi, Ruonan
Office: 139  Phone: 645-8824
Email: ruonansh@buffalo.edu

,\n", "

Solanski, Deepisha
Office: 140 Phone: 645-8825
Email: deepisha@buffalo.edu

,\n", "

Song, Zhao
Office: 131 Phone: 645-8819
Email: zhaosong@buffalo.edu

,\n", "

Sullivan, Mark
Office: 136  Phone: 645-8821
Email: marksull@buffalo.edu

,\n", "

Sun, Yuxun
Office: 140 Phone: 645-8825
Email: yuxunsun@buffalo.edu

,\n", "

Upadhyay, Aparna
Office: 136  Phone: 645-8821
Email: aparnaup@buffalo.edu

,\n", "

Wang, Biao
Office: 125 Phone: 645-8815
Email: bwang32@buffalo.edu

,\n", "

Wang, Daxun
Office: 141 Phone: 645-8825
Email: daxunwan@buffalo.edu

,\n", "

Wang, Shiruo
Office: 125 Phone: 645-8815
Email: shiruo@buffalo.edu

,\n", "

Wang, Weiqi
Office: 125 Phone: 645-8815
Email: weiqiwan@buffalo.edu

,\n", "

Winton, Daniel
Office: 125 Phone: 645-8815
Email: dmwinton@buffalo.edu

,\n", "

Yuan, Cheng
Office: 137  Phone: 645-8822
Email: chengyua@buffalo.edu

,\n", "

Zalloum, Abdalrazzaq R.A.
Personal Website
Office: 140  Phone: 645-8825 
Email: abdalraz@buffalo.edu

,\n", "

Zeng, Naijiang
Office: 137  Phone: 645-8822
Email: naijiang@buffalo.edu

,\n", "

Zhou, Chen
Office: 137  Phone: 645-8822
Email: czhou5@buffalo.edu

,\n", "

Ziegler, Cameron
Office: 140  Phone: 645-8825
Email: cz22@buffalo.edu

,\n", "

244 Mathematics Building
Buffalo, NY 14260-2900
Phone: (716) 645-6284
Fax: (716) 645-5039

]" ] }, "execution_count": 10, "metadata": {}, "output_type": "execute_result" } ], "source": [ "ptags = soup.find_all('p')\n", "ptags" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", "\n", "

The get_text() method returns text enclosed in a tag.

\n", "
" ] }, { "cell_type": "code", "execution_count": 13, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "

Abeya Ranasinghe Mudiyanselage, Asela V.
Office: 129  Phone: 645-8817 
Email: aselavir@buffalo.edu

" ] }, "execution_count": 13, "metadata": {}, "output_type": "execute_result" } ], "source": [ "ptags[1]" ] }, { "cell_type": "code", "execution_count": 14, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "'Abeya Ranasinghe Mudiyanselage, Asela V. Office: 129\\xa0 Phone: 645-8817\\xa0 Email: aselavir@buffalo.edu'" ] }, "execution_count": 14, "metadata": {}, "output_type": "execute_result" } ], "source": [ "ptags[1].get_text()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", "Get text from all paragraphs containing the word \"Office\":\n", "
" ] }, { "cell_type": "code", "execution_count": 11, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "['Abeya Ranasinghe Mudiyanselage, Asela V. Office: 129\\xa0 Phone: 645-8817\\xa0 Email: aselavir@buffalo.edu',\n", " 'Alegria, Linda Office: 138 Phone: 645-8823 Email: lindaale@buffalo.edu',\n", " 'Bittner, Alyson Office: 130\\xa0\\xa0Phone: 645-8818\\xa0\\xa0 Email: alysonbi@buffalo.edu',\n", " 'Cain Charles Office: 126 Phone: 645-8816 Email: ccain2@buffalo.edu',\n", " 'Casper, Michael Office: 222\\xa0 Phone: 645-8779 Email: mjcasper@buffalo.edu',\n", " 'Chang, Hong Office: 136 Phone: 645-8821 Email: hchang24@buffalo.edu',\n", " 'Chen, Yen-Lin Office: 125 Phone: 645-8815 Email: yenlinch@buffalo.edu',\n", " 'Cheuk, Ka Yue Office: 140 Phone: 645-8825 Email: kayueche@buffalo.edu',\n", " 'Chou, Han-Jung Office:140 Phone: 645-8825 Email: hanjungc@buffalo.edu',\n", " 'Cosgrove, Gage (Makenzie) Office: 139 Phone: 645-8824 Email: gagecosg@buffalo.edu',\n", " 'Deutsch, Dustin Office: 140\\xa0 Phone: 645-8825 Email: dustinde@buffalo.edu',\n", " 'Dey, Subhankar Office: 140\\xa0 Phone: 645-8825 Email: subhanka@buffalo.edu',\n", " 'Doga, Hakan Office: 126\\xa0 Phone: 645-8816 Email: hakandog@buffalo.edu',\n", " 'Dong, Sai Office: 138 Phone: 645-8823 Email: saidong@buffalo.edu',\n", " 'Engelhardt, Carolyn Office: 132 Phone: 645-8820 Email: cengelha@buffalo.edu',\n", " 'Goswami, Abhishek Office: 125 Phone: 645-8815 Email: goswami3@buffalo.edu',\n", " 'Gu, Jiaxi Office: 132\\xa0 Phone: 645-8820\\xa0 Email: jiaxigu@buffalo.edu',\n", " 'Hopfensperger, John Office: 139\\xa0 Phone: 645-8824 Email: johnhopf@buffalo.edu',\n", " 'Hung, Tsz Fun Office: 137\\xa0 Phone: 645-8822 Email: tszfunhu@buffalo.edu',\n", " 'Huynh, Bao Office: 131 Phone: 645-8819 Email: baohuynh@buffalo.edu',\n", " 'Jeong, Myonggin Office: 132 Phone: 645-8820 Email: myonggin@buffalo.edu',\n", " 'Jin, Xifeng Office: 137\\xa0 Phone: 645-8822 Email: xifengji@buffalo.edu',\n", " 'Johnson, Megan Office: 132\\xa0 Phone: 645-8820 Email: meganjoh@buffalo.edu',\n", " 'Jones, Raymond Office: 140\\xa0 Phone: 645-8825 Email: rpjones2@buffalo.edu',\n", " 'Kilic, Bengier Ulgen Office: 125 Phone: 645-8815 Email: bengieru@buffalo.edu',\n", " 'Kim, Jiseong Office: 125 Phone: 645-8815 Email: jiseongk@buffalo.edu',\n", " 'Kireyev, Dmitri Office: 138\\xa0 Phone: 645-8823 Email: dmitriki@buffalo.edu',\n", " 'Le, Minh Quang Office: 131\\xa0 Phone: 645-8819 Email: minhquan@buffalo.edu',\n", " 'Lin, Wei Office: 313\\xa0 Phone: 645-8827 Email: wlin27@buffalo.edu',\n", " 'Liu, Ruodan Office: 140 Phone: 645-8825 Email: rliu8@buffalo.edu',\n", " 'Liu,Tianmou Office: 136 Phone: 645-8821 Email: tianmoul@buffalo.edu',\n", " 'Liu, Yuan Office: 140 Phone: 645-8825 Email: yuanliu@buffalo.edu',\n", " 'Lottes, Jonathan Office: 129 Phone: 645-8817 Email: jllottes@buffalo.edu',\n", " 'Ma, Ning Office: 125\\xa0 Phone: 645-8815 Email: nma22@buffalo.edu',\n", " 'Ma, Renda Office: 125\\xa0 Phone: 645-8815 Email: rendama@buffalo.edu',\n", " 'Ma, Yuqing Office: 138\\xa0 Phone: 645-8823\\xa0 Email: yuqingma@buffalo.edu',\n", " 'Mandava, Navya Office: 130\\xa0 Phone: 645-8818 Email: navyatejam@gmail.com',\n", " 'Meng, Lingqi Office: 130 Phone: 645-8818 Email: lingqime@buffalo.edu',\n", " 'Montoro, Michael Office: 126\\xa0 Phone: 645-8816 Email: mnmontor@buffalo.edu',\n", " 'Oregero, Jeffrey Office: 129\\xa0 Phone: 645-8817\\xa0 Email: jaoreger@buffalo.edu',\n", " 'Peng, Jun Office: 139 Phone: 645-8824 Email: jpeng3@buffalo.edu',\n", " 'Raico, Pasquale Office: 140 Phone: 645-8825 Email: ppraico@buffalo.edu',\n", " 'Rozwood, Bud Office: 125 Phone: 645-8815 Email: budrozwo@buffalo.edu',\n", " 'Sailbery, Mitchell Office: 126 Phone: 645-8816 Email: msailsbe@buffalo.edu',\n", " 'Samer, Andrew Office: 130\\xa0 Phone: 645-8818 Email: agsamer@buffalo.edu',\n", " 'Schilder, Marc Office: 313 Phone: 645-8827 Email: mbschild@buffalo.edu',\n", " 'Schmidt, Matthew Office: 129\\xa0\\xa0Phone: 645-8817\\xa0 Email: mwschmid@buffalo.edu',\n", " 'Shi, Ruonan Office: 139\\xa0 Phone: 645-8824 Email: ruonansh@buffalo.edu',\n", " 'Solanski, Deepisha Office: 140 Phone: 645-8825 Email: deepisha@buffalo.edu',\n", " 'Song, Zhao Office: 131 Phone: 645-8819 Email: zhaosong@buffalo.edu',\n", " 'Sullivan, Mark Office: 136\\xa0 Phone: 645-8821 Email: marksull@buffalo.edu',\n", " 'Sun, Yuxun Office: 140 Phone: 645-8825 Email: yuxunsun@buffalo.edu',\n", " 'Upadhyay, Aparna Office: 136\\xa0 Phone: 645-8821 Email: aparnaup@buffalo.edu',\n", " 'Wang, Biao Office: 125 Phone: 645-8815 Email: bwang32@buffalo.edu',\n", " 'Wang, Daxun Office: 141 Phone: 645-8825 Email: daxunwan@buffalo.edu',\n", " 'Wang, Shiruo Office: 125 Phone: 645-8815 Email: shiruo@buffalo.edu',\n", " 'Wang, Weiqi Office: 125 Phone: 645-8815 Email: weiqiwan@buffalo.edu',\n", " 'Winton, Daniel Office: 125 Phone: 645-8815 Email: dmwinton@buffalo.edu',\n", " 'Yuan, Cheng Office: 137\\xa0 Phone: 645-8822 Email: chengyua@buffalo.edu',\n", " 'Zalloum, Abdalrazzaq R.A. Personal Website Office: 140\\xa0 Phone: 645-8825\\xa0 Email: abdalraz@buffalo.edu',\n", " 'Zeng, Naijiang Office: 137\\xa0 Phone: 645-8822 Email: naijiang@buffalo.edu',\n", " 'Zhou, Chen Office: 137\\xa0 Phone: 645-8822 Email: czhou5@buffalo.edu',\n", " 'Ziegler, Cameron Office: 140\\xa0 Phone: 645-8825 Email: cz22@buffalo.edu']" ] }, "execution_count": 11, "metadata": {}, "output_type": "execute_result" } ], "source": [ "grad_list = [t.get_text() for t in ptags if \"Office\" in t.get_text()]\n", "grad_list" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", "go top\n", "\n", "

\n", "4. String Processing\n", "

\n", " \n", "
\n", "\n", "Here we get names and emails from the strings.\n", "\n", "
" ] }, { "cell_type": "code", "execution_count": 13, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "'Deutsch, Dustin Office: 140\\xa0 Phone: 645-8825 Email: dustinde@buffalo.edu'" ] }, "execution_count": 13, "metadata": {}, "output_type": "execute_result" } ], "source": [ "dustin = grad_list[10]\n", "dustin" ] }, { "cell_type": "code", "execution_count": 19, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "17" ] }, "execution_count": 19, "metadata": {}, "output_type": "execute_result" } ], "source": [ "n = dustin.find(\"Office\")\n", "n" ] }, { "cell_type": "code", "execution_count": 21, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "'Deutsch, Dustin'" ] }, "execution_count": 21, "metadata": {}, "output_type": "execute_result" } ], "source": [ "dustin[0:n].strip()" ] }, { "cell_type": "code", "execution_count": 16, "metadata": {}, "outputs": [], "source": [ "def get_name(s):\n", " stop = s.find(\"Office:\")\n", " return s[:stop].strip()" ] }, { "cell_type": "code", "execution_count": 17, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "'Deutsch, Dustin'" ] }, "execution_count": 17, "metadata": {}, "output_type": "execute_result" } ], "source": [ "get_name(dustin)" ] }, { "cell_type": "code", "execution_count": 18, "metadata": {}, "outputs": [ { "data": { "text/plain": [ "'dustinde@buffalo.edu'" ] }, "execution_count": 18, "metadata": {}, "output_type": "execute_result" } ], "source": [ "def get_email(s):\n", " start = s.find(\"Email:\") + 6\n", " return s[start:].strip()\n", "\n", "get_email(dustin)" ] }, { "cell_type": "code", "execution_count": 19, "metadata": { "scrolled": false }, "outputs": [ { "name": "stdout", "output_type": "stream", "text": [ "Name: Abeya Ranasinghe Mudiyanselage, Asela V.\n", "Email: aselavir@buffalo.edu\n", "\n", "Name: Alegria, Linda\n", "Email: lindaale@buffalo.edu\n", "\n", "Name: Bittner, Alyson\n", "Email: alysonbi@buffalo.edu\n", "\n", "Name: Cain Charles\n", "Email: ccain2@buffalo.edu\n", "\n", "Name: Casper, Michael\n", "Email: mjcasper@buffalo.edu\n", "\n", "Name: Chang, Hong\n", "Email: hchang24@buffalo.edu\n", "\n", "Name: Chen, Yen-Lin\n", "Email: yenlinch@buffalo.edu\n", "\n", "Name: Cheuk, Ka Yue\n", "Email: kayueche@buffalo.edu\n", "\n", "Name: Chou, Han-Jung\n", "Email: hanjungc@buffalo.edu\n", "\n", "Name: Cosgrove, Gage (Makenzie)\n", "Email: gagecosg@buffalo.edu\n", "\n", "Name: Deutsch, Dustin\n", "Email: dustinde@buffalo.edu\n", "\n", "Name: Dey, Subhankar\n", "Email: subhanka@buffalo.edu\n", "\n", "Name: Doga, Hakan\n", "Email: hakandog@buffalo.edu\n", "\n", "Name: Dong, Sai\n", "Email: saidong@buffalo.edu\n", "\n", "Name: Engelhardt, Carolyn\n", "Email: cengelha@buffalo.edu\n", "\n", "Name: Goswami, Abhishek\n", "Email: goswami3@buffalo.edu\n", "\n", "Name: Gu, Jiaxi\n", "Email: jiaxigu@buffalo.edu\n", "\n", "Name: Hopfensperger, John\n", "Email: johnhopf@buffalo.edu\n", "\n", "Name: Hung, Tsz Fun\n", "Email: tszfunhu@buffalo.edu\n", "\n", "Name: Huynh, Bao\n", "Email: baohuynh@buffalo.edu\n", "\n", "Name: Jeong, Myonggin\n", "Email: myonggin@buffalo.edu\n", "\n", "Name: Jin, Xifeng\n", "Email: xifengji@buffalo.edu\n", "\n", "Name: Johnson, Megan\n", "Email: meganjoh@buffalo.edu\n", "\n", "Name: Jones, Raymond\n", "Email: rpjones2@buffalo.edu\n", "\n", "Name: Kilic, Bengier Ulgen\n", "Email: bengieru@buffalo.edu\n", "\n", "Name: Kim, Jiseong\n", "Email: jiseongk@buffalo.edu\n", "\n", "Name: Kireyev, Dmitri\n", "Email: dmitriki@buffalo.edu\n", "\n", "Name: Le, Minh Quang\n", "Email: minhquan@buffalo.edu\n", "\n", "Name: Lin, Wei\n", "Email: wlin27@buffalo.edu\n", "\n", "Name: Liu, Ruodan\n", "Email: rliu8@buffalo.edu\n", "\n", "Name: Liu,Tianmou\n", "Email: tianmoul@buffalo.edu\n", "\n", "Name: Liu, Yuan\n", "Email: yuanliu@buffalo.edu\n", "\n", "Name: Lottes, Jonathan\n", "Email: jllottes@buffalo.edu\n", "\n", "Name: Ma, Ning\n", "Email: nma22@buffalo.edu\n", "\n", "Name: Ma, Renda\n", "Email: rendama@buffalo.edu\n", "\n", "Name: Ma, Yuqing\n", "Email: yuqingma@buffalo.edu\n", "\n", "Name: Mandava, Navya\n", "Email: navyatejam@gmail.com\n", "\n", "Name: Meng, Lingqi\n", "Email: lingqime@buffalo.edu\n", "\n", "Name: Montoro, Michael\n", "Email: mnmontor@buffalo.edu\n", "\n", "Name: Oregero, Jeffrey\n", "Email: jaoreger@buffalo.edu\n", "\n", "Name: Peng, Jun\n", "Email: jpeng3@buffalo.edu\n", "\n", "Name: Raico, Pasquale\n", "Email: ppraico@buffalo.edu\n", "\n", "Name: Rozwood, Bud\n", "Email: budrozwo@buffalo.edu\n", "\n", "Name: Sailbery, Mitchell\n", "Email: msailsbe@buffalo.edu\n", "\n", "Name: Samer, Andrew\n", "Email: agsamer@buffalo.edu\n", "\n", "Name: Schilder, Marc\n", "Email: mbschild@buffalo.edu\n", "\n", "Name: Schmidt, Matthew\n", "Email: mwschmid@buffalo.edu\n", "\n", "Name: Shi, Ruonan\n", "Email: ruonansh@buffalo.edu\n", "\n", "Name: Solanski, Deepisha\n", "Email: deepisha@buffalo.edu\n", "\n", "Name: Song, Zhao\n", "Email: zhaosong@buffalo.edu\n", "\n", "Name: Sullivan, Mark\n", "Email: marksull@buffalo.edu\n", "\n", "Name: Sun, Yuxun\n", "Email: yuxunsun@buffalo.edu\n", "\n", "Name: Upadhyay, Aparna\n", "Email: aparnaup@buffalo.edu\n", "\n", "Name: Wang, Biao\n", "Email: bwang32@buffalo.edu\n", "\n", "Name: Wang, Daxun\n", "Email: daxunwan@buffalo.edu\n", "\n", "Name: Wang, Shiruo\n", "Email: shiruo@buffalo.edu\n", "\n", "Name: Wang, Weiqi\n", "Email: weiqiwan@buffalo.edu\n", "\n", "Name: Winton, Daniel\n", "Email: dmwinton@buffalo.edu\n", "\n", "Name: Yuan, Cheng\n", "Email: chengyua@buffalo.edu\n", "\n", "Name: Zalloum, Abdalrazzaq R.A. Personal Website\n", "Email: abdalraz@buffalo.edu\n", "\n", "Name: Zeng, Naijiang\n", "Email: naijiang@buffalo.edu\n", "\n", "Name: Zhou, Chen\n", "Email: czhou5@buffalo.edu\n", "\n", "Name: Ziegler, Cameron\n", "Email: cz22@buffalo.edu\n", "\n" ] } ], "source": [ "for s in grad_list:\n", " name = get_name(s)\n", " email = get_email(s)\n", " print(f\"Name: {name}\")\n", " print(f\"Email: {email}\")\n", " print()" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", "

\n", " Part 3: Numpy  \n", " \n", " \n", "

\n", "
" ] } ], "metadata": { "kernelspec": { "display_name": "Python 3", "language": "python", "name": "python3" }, "language_info": { "codemirror_mode": { "name": "ipython", "version": 3 }, "file_extension": ".py", "mimetype": "text/x-python", "name": "python", "nbconvert_exporter": "python", "pygments_lexer": "ipython3", "version": "3.7.1" } }, "nbformat": 4, "nbformat_minor": 2 }