Scaling up Machine Learning: Parallel and Distributed Approaches

doi:10.1017/CBO9781139042918

Scaling up Machine Learning

Parallel and Distributed Approaches

- Get access
  
  Buy a print copy
  
  Check if you have access via personal or institutional login
  
  Log in Register
Cited by 158
Cited by
- 158
Crossref Citations

This Book has been cited by the following publications. This list is generated based on data provided by Crossref.

Tsianos, Konstantinos I. Lawlor, Sean and Rabbat, Michael G. 2012. Consensus-based distributed optimization: Practical issues and applications in large-scale machine learning. p. 1543.

CrossRef

Google Scholar

STRUHARIK, RASTISLAV J. R. and NOVAK, LADISLAV A. 2013. HARDWARE IMPLEMENTATION OF DECISION TREE ENSEMBLES. Journal of Circuits, Systems and Computers, Vol. 22, Issue. 05, p. 1350032.

CrossRef

Google Scholar

Zheng, Lu and Mengshoel, Ole 2013. Optimizing parallel belief propagation in junction treesusing regression. p. 757.

CrossRef

Google Scholar

Chrysos, Grigorios Dagritzikos, Panagiotis Papaefstathiou, Ioannis and Dollas, Apostolos 2013. HC-CART. ACM Transactions on Architecture and Code Optimization, Vol. 9, Issue. 4, p. 1.

CrossRef

Google Scholar

Lee, Soomin and Nedic, Angelia 2013. Distributed mini-batch random projection algorithms for reduced communication overhead. p. 559.

CrossRef

Google Scholar

Tsianos, Konstantinos I. Lawlor, Sean F. Yu, Jun Ye and Rabbat, Michael G. 2013. Networked optimization with adaptive communication. p. 579.

CrossRef

Google Scholar

McMahan, H. Brendan Holt, Gary Sculley, D. Young, Michael Ebner, Dietmar Grady, Julian Nie, Lan Phillips, Todd Davydov, Eugene Golovin, Daniel Chikkerur, Sharat Liu, Dan Wattenberg, Martin Hrafnkelsson, Arnar Mar Boulos, Tom and Kubica, Jeremy 2013. Ad click prediction. p. 1222.

CrossRef

Google Scholar

HEGEDŰS, ISTVÁN ORMÁNDI, RÓBERT and JELASITY, MÁRK 2013. MASSIVELY DISTRIBUTED CONCEPT DRIFT HANDLING IN LARGE NETWORKS. Advances in Complex Systems, Vol. 16, Issue. 04n05, p. 1350021.

CrossRef

Google Scholar

Ngufor, Che and Wojtusiak, Janusz 2014. Learning from Large Distributed Data: A Scaling Down Sampling Scheme for Efficient Data Processing. International Journal of Machine Learning and Computing, Vol. 4, Issue. 3, p. 216.

CrossRef

Google Scholar

Tsianos, Konstantinos I. Sarwate, Anand D. and Rabbat, Michael G. 2014. Tradeoffs for task parallelization in distributed optimization. p. 1.

CrossRef

Google Scholar

Clemencon, Stephan Bertail, Patrice and Chautru, Emilie 2014. Scaling up M-estimation via sampling designs: The Horvitz-Thompson stochastic gradient descent. p. 25.

CrossRef

Google Scholar

Miller, Lisa J. Gazan, Rich and Still, Susanne 2014. Unsupervised classification and visualization of unstructured text for the support of interdisciplinary collaboration. p. 1033.

CrossRef

Google Scholar

Struharik, R. 2015. IP cores for hardware acceleration of decision tree ensemble classifiers. p. 45.

CrossRef

Google Scholar

Mokhtari, Aryan and Ribeiro, Alejandro 2015. Decentralized double stochastic averaging gradient. p. 406.

CrossRef

Google Scholar

Mokhtari, Aryan Shi, Wei Ling, Qing and Ribeiro, Alejandro 2015. Decentralized quadratically approximated alternating direction method of multipliers. p. 795.

CrossRef

Google Scholar

Landset, Sara Khoshgoftaar, Taghi M. Richter, Aaron N. and Hasanin, Tawfiq 2015. A survey of open source tools for machine learning with big data in the Hadoop ecosystem. Journal of Big Data, Vol. 2, Issue. 1,

CrossRef

Google Scholar

Kourid, Ahlem and Batouche, Mohamed 2015. A novel approach for feature selection based on MapReduce for biomarker discovery. p. 1.

CrossRef

Google Scholar

Yu, Chung-Kai van der Schaar, Mihaela and Sayed, Ali H. 2015. Information-Sharing Over Adaptive Networks With Self-Interested Agents. IEEE Transactions on Signal and Information Processing over Networks, Vol. 1, Issue. 1, p. 2.

CrossRef

Google Scholar

Struharik, R. 2015. Decision tree ensemble hardware accelerators for embedded applications. p. 101.

CrossRef

Google Scholar

Ure, N. Kemal Omidshafiei, Shayegan Lopez, Brett Thomas Agha-Mohammadi, Ali-akbar How, Jonathan P. and Vian, John 2015. Online heterogeneous multiagent learning under limited communication with applications to forest fire management. p. 5181.

CrossRef

Google Scholar

Download full list

Edited by Ron Bekkerman, LinkedIn Corporation, Mountain View, California, Mikhail Bilenko, Microsoft Research, Redmond, Washington, John Langford, Yahoo! Research, New York

Publisher:: Cambridge University Press
Online publication date:: February 2012
Print publication year:: 2011
Online ISBN:: 9781139042918
DOI:: https://doi.org/10.1017/CBO9781139042918

Subjects:: Computer Science, Pattern Recognition and Machine Learning, Distributed, Networked and Mobile Computing

60.99 (USD)

Digital access for individuals
(PDF download and/or read online)
Add to cart

Added to cart

Digital access for individuals
(PDF download and/or read online)
View cart
Export citation
Buy a print copy

Information

Contents

Metrics

This book presents an integrated collection of representative approaches for scaling up machine learning and data mining methods on parallel and distributed computing platforms. Demand for parallelizing learning algorithms is highly task-specific: in some settings it is driven by the enormous dataset sizes, in others by model complexity or by real-time performance requirements. Making task-appropriate algorithm and platform choices for large-scale machine learning requires understanding the benefits, trade-offs and constraints of the available options. Solutions presented in the book cover a range of parallelization platforms from FPGAs and GPUs to multi-core systems and commodity clusters, concurrent programming frameworks including CUDA, MPI, MapReduce and DryadLINQ, and learning settings (supervised, unsupervised, semi-supervised and online learning). Extensive coverage of parallelization of boosted trees, SVMs, spectral clustering, belief propagation and other popular learning algorithms, and deep dives into several applications, make the book equally useful for researchers, students and practitioners.

‘One of the landmark achievements of our time is the ability to extract value from large volumes of data. Engineering and algorithmic developments on this front have gelled substantially in recent years, and are quickly being reduced to practice in widely available, reusable forms. This book provides a broad and timely snapshot of the state of developments in scalable machine learning, which should be of interest to anyone who wishes to understand and extend the state of the art in analyzing data.’

Joseph M. Hellerstein - University of California, Berkeley

‘This is a book that every machine learning practitioner should keep in their library.’

Yoram Singer - Google Inc.

‘The contributions in this book run the gamut from frameworks for large-scale learning to parallel algorithms to applications, and contributors include many of the top people in this burgeoning subfield. Overall this book is an invaluable resource for anyone interested in the problem of learning from and working with big datasets.’

William W. Cohen - Carnegie Mellon University, Pennsylvania

‘This unique, timely book provides a 360 degrees view and understanding of both conceptual and practical issues that arise when implementing leading machine learning algorithms on a wide range of parallel and high-performance computing platforms. It will serve as an indispensable handbook for the practitioner of large-scale data analytics and a guide to dealing with BIG data and making sound choices for efficient applying learning algorithms to them. It can also serve as the basis for an attractive graduate course on parallel/distributed machine learning and data mining.’

Joydeep Ghosh - University of Texas

- Aa Reduce text
- Aa Enlarge text

View selected items
Save to my bookmarks
Export citations
Download PDF (zip)
Save to Kindle
Save to Dropbox
Save to Google Drive
Save content to
To save content items to your account, please confirm that you agree to abide by our usage policies. If this is the first time you use this feature, you will be asked to authorise Cambridge Core to connect with your account. Find out more about saving content to .

To save content items to your Kindle, first ensure no-reply@cambridge.org is added to your Approved Personal Document E-mail List under your Personal Document Settings on the Manage Your Content and Devices page of your Amazon account. Then enter the ‘name’ part of your Kindle email address below. Find out more about saving to your Kindle.

Note you can select to save to either the @free.kindle.com or @kindle.com variations. ‘@free.kindle.com’ emails are free but can only be saved to your device when it is connected to wi-fi. ‘@kindle.com’ emails can be delivered even when you are not connected to wi-fi, but note that service fees apply.

Find out more about the Kindle Personal Document Service.

Please be advised that item(s) you selected are not available.
You are about to save
Your Kindle email address

Please provide your Kindle email.

@free.kindle.com @kindle.com (service fees apply)

By using this service, you agree that you will only keep content for personal use, and will not openly distribute them via Dropbox, Google Drive or other file sharing services

Metrics

Altmetric attention score

Total number of HTML views: 0

Total number of PDF views: 0 *

Loading metrics...

Total views: 0 *

Loading metrics...

* Views captured on Cambridge Core between #date#. This data will be updated every 24 hours.

Usage data cannot currently be displayed.

Scaling up Machine Learning

Parallel and Distributed Approaches

Book description

Reviews

Refine List

Actions for selected content:

Save Search

Contents

Metrics

Altmetric attention score

Full text views

Book summary page views