Principles and Theory for Data Mining and Machine Learning [Clarke, Fokoué & Zhang 2009-07-30](1).pdf
(
13459 KB
)
Pobierz
Springer Series in Statistics
Advisors:
P. Bickel, P. Diggle, S. Fienberg, U. Gather,
I. Olkin, S. Zeger
For other titles published in this series go to,
http://www.springer.com/series/692
Bertrand Clarke
·
Ernest Fokou´
·
Hao Helen Zhang
e
Principles and Theory
for Data Mining
and Machine Learning
123
Bertrand Clarke
University of Miami
120 NW 14th Street
CRB 1055 (C-213)
Miami, FL, 33136
bclarke2@med.miami.edu
Hao Helen Zhang
Department of Statistics
North Carolina State University
Genetics
P.O.Box 8203
Raleigh, NC 27695-8203
USA
hzhang2@stat.ncsu.edu
Ernest Fokou´
e
Center for Quality and Applied Statistics
Rochester Institute of Technology
98 Lomb Memorial Drive
Rochester, NY 14623
ernest.fokoue@gmail.com
ISSN 0172-7397
ISBN 978-0-387-98134-5
e-ISBN 978-0-387-98135-2
DOI 10.1007/978-0-387-98135-2
Springer Dordrecht Heidelberg London New York
Library of Congress Control Number: 2009930499
c Springer Science+Business Media, LLC 2009
All rights reserved. This work may not be translated or copied in whole or in part without the written
permission of the publisher (Springer Science+Business Media, LLC, 233 Spring Street, New York,
NY 10013, USA), except for brief excerpts in connection with reviews or scholarly analysis. Use in
connection with any form of information storage and retrieval, electronic adaptation, computer
software, or by similar or dissimilar methodology now known or hereafter developed is forbidden.
The use in this publication of trade names, trademarks, service marks, and similar terms, even if
they are not identified as such, is not to be taken as an expression of opinion as to whether or not
they are subject to proprietary rights.
Printed on acid-free paper
Springer is part of Springer Science+Business Media (www.springer.com)
Preface
The idea for this book came from the time the authors spent at the Statistics and
Applied Mathematical Sciences Institute (SAMSI) in Research Triangle Park in North
Carolina starting in fall 2003. The first author was there for a total of two years, the
first year as a Duke/SAMSI Research Fellow. The second author was there for a year
as a Post-Doctoral Scholar. The third author has the great fortune to be in RTP per-
manently. SAMSI was – and remains – an incredibly rich intellectual environment
with a general atmosphere of free-wheeling inquiry that cuts across established fields.
SAMSI encourages creativity: It is the kind of place where researchers can be found at
work in the small hours of the morning – computing, interpreting computations, and
developing methodology. Visiting SAMSI is a unique and wonderful experience.
The people most responsible for making SAMSI the great success it is include Jim
Berger, Alan Karr, and Steve Marron. We would also like to express our gratitude to
Dalene Stangl and all the others from Duke, UNC-Chapel Hill, and NC State, as well
as to the visitors (short and long term) who were involved in the SAMSI programs. It
was a magical time we remember with ongoing appreciation.
While we were there, we participated most in two groups: Data Mining and Machine
Learning, for which Clarke was the group leader, and a General Methods group run
by David Banks. We thank David for being a continual source of enthusiasm and
inspiration. The first chapter of this book is based on the outline of the first part of
his short course on Data Mining and Machine Learning. Moreover, David graciously
contributed many of his figures to us. Specifically, we gratefully acknowledge that
Figs. 1.1–6, Figs. 2.1,3,4,5,7, Fig. 4.2, Figs. 8.3,6, and Figs. 9.1,2 were either done by
him or prepared under his guidance.
On the other side of the pond, the Newton Institute at Cambridge University provided
invaluable support and stimulation to Clarke when he visited for three months in 2008.
While there, he completed the final versions of Chapters 8 and 9. Like SAMSI, the
Newton Institute was an amazing, wonderful, and intense experience.
This work was also partially supported by Clarke’s NSERC Operating Grant
2004–2008. In the USA, Zhang’s research has been supported over the years by two
v
Plik z chomika:
musli_com
Inne pliki z tego folderu:
21 Recipes for Mining Twitter_ Distilling Rich Information from Messy Data [Russell 2011-03-10](1).pdf
(1049 KB)
Active Mining_ New Directions of Data Mining [Motoda 2002-07-29](2).pdf
(8618 KB)
Advanced Data Mining Techniques [Olson & Delen 2008-01-21](1).pdf
(1098 KB)
Advances in Data Mining_ Knowledge Discovery and Applications [Karahoca 2014](2).pdf
(15624 KB)
Advances in K-means Clustering_ A Data Mining Thinking [Wu 2012-07-10](1).pdf
(2511 KB)
Inne foldery tego chomika:
cheat-sheets
Data Structures
Demystified Series
Dreamweaver
Eclipse
Zgłoś jeśli
naruszono regulamin