Data Science from Scratch- First Principles with Python.pdf

(5705 KB) Pobierz
Data Science
from Scratch
FIRST PRINCIPLES WITH PYTHON
Joel Grus
www.allitebooks.com
Data Science from Scratch
Data science libraries, frameworks, modules, and toolkits are great for
doing data science, but they’re also a good way to dive into the discipline
without actually understanding data science. In this book, you’ll learn how
many of the most fundamental data science tools and algorithms work by
implementing them from scratch.
If you have an aptitude for mathematics and some programming skills,
author Joel Grus will help you get comfortable with the math and statistics
at the core of data science, and with hacking skills you need to get started
as a data scientist. Today’s messy glut of data holds answers to questions
no one’s even thought to ask. This book provides you with the know-how
to dig those answers out.
Joel takes you on a
journey from being
data-curious to getting a
thorough understanding
of the bread-and-butter
algorithms that every data
scientist should know.
—Rohit Sivaprasad
Data Science, Soylent
datatau.com
Get a crash course in Python
Learn the basics of linear algebra, statistics, and probability—
and understand how and when they're used in data science
Collect, explore, clean, munge, and manipulate data
Dive into the fundamentals of machine learning
Implement models such as k-nearest neighbors, Naive Bayes,
linear and logistic regression, decision trees, neural networks,
and clustering
Explore recommender systems, natural language processing,
network analysis, MapReduce, and databases
Joel Grus
is a software engineer at Google. Before that, he worked as a data
scientist at multiple startups. He lives in Seattle, where he regularly attends data
science happy hours. He blogs infrequently at
joelgrus.com
and tweets all day
long at @joelgrus.
DATA /DATA SCIENCE
Twitter: @oreillymedia
facebook.com/oreilly
CAN $45.99
US $39.99
ISBN: 978-1-491-90142-7
www.allitebooks.com
Data Science from Scratch
Joel Grus
www.allitebooks.com
Data Science from Scratch
by Joel Grus
Copyright © 2015 O’Reilly Media. All rights reserved.
Printed in the United States of America.
Published by O’Reilly Media, Inc., 1005 Gravenstein Highway North, Sebastopol, CA 95472.
O’Reilly books may be purchased for educational, business, or sales promotional use. Online editions are
also available for most titles (http://safaribooksonline.com). For more information, contact our corporate/
institutional sales department: 800-998-9938 or
corporate@oreilly.com.
Editor:
Marie Beaugureau
Production Editor:
Melanie Yarbrough
Copyeditor:
Nan Reinhardt
Proofreader:
Eileen Cohen
April 2015:
First Edition
Indexer:
Ellen Troutman-Zaig
Interior Designer:
David Futato
Cover Designer:
Karen Montgomery
Illustrator:
Rebecca Demarest
Revision History for the First Edition
2015-04-10:
First Release
See
http://oreilly.com/catalog/errata.csp?isbn=9781491901427
for release details.
The O’Reilly logo is a registered trademark of O’Reilly Media, Inc.
Data Science from Scratch,
the cover
image of a Rock Ptarmigan, and related trade dress are trademarks of O’Reilly Media, Inc.
While the publisher and the author have used good faith efforts to ensure that the information and
instructions contained in this work are accurate, the publisher and the author disclaim all responsibility
for errors or omissions, including without limitation responsibility for damages resulting from the use of
or reliance on this work. Use of the information and instructions contained in this work is at your own
risk. If any code samples or other technology this work contains or describes is subject to open source
licenses or the intellectual property rights of others, it is your responsibility to ensure that your use
thereof complies with such licenses and/or rights.
978-1-491-90142-7
[LSI]
www.allitebooks.com
Table of Contents
Preface. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . xi
1. Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
The Ascendance of Data
What Is Data Science?
Motivating Hypothetical: DataSciencester
Finding Key Connectors
Data Scientists You May Know
Salaries and Experience
Paid Accounts
Topics of Interest
Onward
The Basics
Getting Python
The Zen of Python
Whitespace Formatting
Modules
Arithmetic
Functions
Strings
Exceptions
Lists
Tuples
Dictionaries
Sets
Control Flow
1
1
2
3
6
8
11
11
13
2. A Crash Course in Python. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
15
15
16
16
17
18
18
19
19
20
21
21
24
25
iii
www.allitebooks.com
Zgłoś jeśli naruszono regulamin