Skip to content

Commit 27a4c94

Browse files
committed
new pandas resource
1 parent 8231625 commit 27a4c94

File tree

1 file changed

+4
-0
lines changed

1 file changed

+4
-0
lines changed

content/pages/03-data/16-pandas.markdown

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -114,3 +114,7 @@ is a data structures and analysis library.
114114
explains that pandas operations can often be parallelized for better
115115
performance using the Pandas UDFs feature in PySpark version 2.3
116116
or greater.
117+
118+
* [How to use Pandas read_html to Scrape Data from HTML Tables](https://www.marsja.se/how-to-use-pandas-read_html-to-scrape-data-from-html-tables/)
119+
has a bunch of great code examples that show how to load
120+
data from HTML directly into your DataFrames.

0 commit comments

Comments
 (0)