- Jin Chang, Jun Luo, and Joshua Zhexue Huang, "Minimum spanning tree based classification model for massive data with MapReduce implementation"
- Claudia Plant and Christian Böhm, "Parallel EM-Clustering: Fast Convergence by Asynchronous Model Updates"
- Huizhi Liang, "Parallel User profiling based on folksonomy for Large Scaled Recommender Systems-An implimentation of Cascading MapReduce"
- Venkatram Ramanathan, Wenjing Ma, Vignesh Ravi, and Gagan Agrawal, "Parallelizing Information Theoretic Co-clustering Algorithm Using a Cloud Middleware"
- James Horey, "Challenges in Scheduling Aggregation in CyberPhysical Information Processing Systems"
- Leonardo Neumeyer, Bruce Robbins, Anish Nair, and Anandsudhakar Kesari, "S4: Distributed Stream Computing Platform"
- Li Lu, Yunhong Gu, and Robert Grossman, "dSimpleGraph: a Novel Distributed Clustering Algorithm for Exploring Very Large Scale Unknown Data Sets"
- Andrea Campagna and Rasmus Pagh, "On Finding Similar Items in a Stream of Transactions"
- Srivatsava Daruru, Sankari Dhandapani, Gunjan Gupta, Ilian Iliev, Weijia Xu, Paul Navratil, Nena Marin, and Joydeep Ghosh, "Distributed, Scalable Clustering for Detecting Halos in Terascale Astronomy"