CSci 8363 -- Fall 2008 -- List of Papers
Tentative early list of papers to be presented during the semester.
General
x Linear Algebra in Data Exploration - Examples
01 09/02 Intro - Examples. "bio.pdf" - Examples of Linear Algebra in data Mining (Boley)
02 09/04 Review of linear algebra "intro.pdf" - Intriductory/Review slides
Basics of Eigenvalues, PCA definition
x A tutorial on Principal Components Analysis
Lindsey I Smith
http://www.cs.otago.ac.nz/cosc453/student_tutorials/principal_components.pdf
03 09/09 PCA + Elem Stat Boley
x A Tutorial on Principal Component Analysis
Jonathon Shlens
http://www.snl.salk.edu/~shlens/pub/notes/pca.pdf
(also "http://www.dgp.toronto.edu/~aranjan/tuts/pca.pdf")
04 09/11 PCA by example, PCA<->SVD "PCA.pdf ", by Boley
Information retrieval
x Computational Methods for Intelligent Information Access. M.W.
Berry, S.T. Dumais, and T.A. Letsche.Proceedings of Supercomputing'95,
San Diego, CA, December 1995.
http://citeseer.ist.psu.edu/438263.html
http://hpc.isti.cnr.it/~palmeri/datam/articles/SC95.ps
05 09/16 Latent Semantic Indexing slides by Vijay Kumar Adhikari
x Latent semantic indexing via a semi-discrete matrix decomposition
Tamara G. Kolda and Dianne P, O'Leary, in The Mathematics
of Information Coding, Extraction and Distribution, G. Cybenko et al.,
eds., vol. 107 of IMA Volumes in Mathematics and Its Applications.
Springer-Verlag, 1999, pp. 73-80.
http://portal.acm.org/citation.cfm?id=291131
06 09/18 Semi-discrete matrix decomp. slides by Joshua Jorensby
Clustering using vector-based similarities
x Outlier Detection Using SemiDiscrete Decomposition (2002)
S. McConnell, D. Skillicorn
http://citeseer.ist.psu.edu/mcconnell02outlier.html
07 09/23 SDD outlier application slides by Bryan Decaire
Eigenfaces (part I)
x (Probabilistic Visual Learning for Object Representation, Baback
Moghaddam, Alex Pentland Early Visual Learning, Oxford University
Press, 1996.
http://citeseer.ist.psu.edu/moghaddam96probabilistic.html
08 09/25 Eigenfaces slides by Ravishankar Sivalingam
Clustering using vector-based similarities (continued)
x Concept Decompositions for Large Sparse Text Data using Clustering.
I.S. Dhillon, D.S. Modha, IBM Research Report RJ 10147, July 8, 1999,
Machine Learning, 42:1, pages 143-175, January 2001.
http://www.cs.utexas.edu/users/inderjit/public_papers/concept_mlj.pdf
09 09/30 Concept Decompositions slides by Michael Karp
x Concept Decomposition Using Clustering
I.S. Dhillon, D.S. Modha
Patent 6560597
http://www.google.com/patents?id=ghQPAAAAEBAJ&dq=6560597.
10 10/07 Concept Decomp: Patent slides by Boley
x Streaming Data Reduction Using Low-Memory Factored Representations.
David Littau and Daniel Boley.
Journal of Information Sciences, 176(14):2016-2041, Elsevier, 2006.
http://www-users.cs.umn.edu/~boley/publications/papers/Streaming05.pdf
?? ??/?? Application: streaming data _____EMPTY____________________
xx 10/02 on travel
x Video Google: A Text Retrieval Approach to Object Matching in Videos
Sivic, J. and Zisserman, A.
Proceedings of the International Conference on Computer Vision (2003)
http://www.robots.ox.ac.uk/~vgg/publications/papers/sivic03.pdf
Demo: http://www.robots.ox.ac.uk/~vgg/research/vgoogle/index.html
Slides: https://www.ipam.ucla.edu/publications/sews2/sews2_7272.pdf
10 10/09 Application: images slides by Weikang Qian
x Fast Monte Carlo Algorithms for Matrices I: Approximating Matrix Multiplication
Fast Monte Carlo Algorithms for Matrices II: Computing a Low Rank Approximation to a Matrix
P. Drineas, R. Kannan, and M.W. Mahoney
SIAM J Computing 2005
I: http://www.cs.rpi.edu/~drinep/matrixI_SICOMP.pdf
II: http://www.cs.rpi.edu/~drinep/matrixII_SICOMP.pdf
Related Slides: http://www.cs.rpi.edu/~drinep/SDMtutorial.ppt
11 10/14 Randomized Matrix Approx slides by Dong Jiao
(slides accessible only locally from 'umn.edu').
SVD/PCA in Biology -- Microarray analysis. (choose one)
x Singular value decomposition for genome-wide expression data processing and modeling
Orly Alter, Patrick O. Brown, David Botstein
Proc Natl Acad Sci U S A. 2000 August 29; 97(18): 10101-10106.
http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=27718
x Example of how SVD/PCA are used to analyse data - using example from Biology
Singular value decomposition and principal component analysis.
Michael E. Wall
http://public.lanl.gov/mewall/kluwer2002.html
12 10/16 PCA in Biology slides by Gyan Ranjan.
PDF version of slides.
x (Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms
Orly Alter, Patrick O. Brown, and David Botstein
PNAS, March 18, 2003, vol. 100, no. 6, pp 3351-3356
http://www.pnas.org/cgi/content/abstract/100/6/3351)
Linear discrinimant Analsys (LDA)
x Fisher Linear Discriminant Analysis
Max Wellman
http://www.cs.huji.ac.il/~csip/Fisher-LDA.pdf
13 10/21 Basic LDA shortintro slides by Zhonghua Jiang.
x Developmental Stage Annotation of Drosophila Gene Expression Pattern Images via an Entire Solution Path for LDA
Jieping Ye, Jianhui Chen, Ravi Janardan, Sudhir Kumar
http://doi.acm.org/10.1145/1342320.1342324
14 10/23 LDA example use slides by Jianxin Fang.
Multidimensional Scaling (one paper to be chosen)
x MULTIDIMENSIONAL SCALING
Forrest W. Young
http://forrest.psych.unc.edu/teaching/p208a/mds/mds.html
x Multidimensional Scaling
Stephen P. Borgatti
http://www.analytictech.com/borgatti/mds.htm
15 10/28 MDS final slides by Anoop Cherian.
ISOMAP, Local Linear Embedding and visualization (one paper to be chosen)
x An Introduction to Locally Linear Embedding. Lawrence Saul & Sam
Roweis. [draft version (Jan.01)]
http://www.cs.toronto.edu/%7Eroweis/lle/papers/lleintro.pdf
(also "http://www.cs.toronto.edu/~roweis/lle/publications.html")
16 10/30 Isomap + Locally Linear Embedding slides by Thanh Ngo.
Non-Negative Matrix Factorization
x Sparse Non-negative Matrix Factorizations via Alternating
Non-negativity-constrained Least Squares for Microarray Data Analysis,
H Kim and H Park
Bioinformatics, 23-12:1495-1502, 2007.
http://www.cc.gatech.edu/~hpark/papers/kp07snmf.pdf
17 11/04 NMF in micro array slides by John Joseph.
Link Analysis -- PageRank. HITS
x The PageRank Citation Ranking: Bringing Order to the Web.Page,
Lawrence; Brin, Sergey; Motwani, Rajeev; Winograd, Terry. Stanford
Univ. Computer Science Dept technical report. Oct. 2001
http://citeseer.ist.psu.edu/page98pagerank.html
18 11/06 PageRank idea slides by Shuo Guo.
x Deeper Inside PageRank
Amy N. Langville and Carl D. Meyer
Internet Mathematics Vol. 1, No. 3: 335-380
http://projecteuclid.org/DPubS?service=UI&version=1.0&verb=Display&handle=euclid.im/1109190965
http://projecteuclid.org/euclid.im/1109190965
19 11/11 Inside Pagerank revised slides by Shanzhen Chen.
x Authoritative Sources in a Hyperlinked Environment
Jon M. Kleinberg
Proc. 9th ACM-SIAM Symposium on Discrete Algorithms, 1998.
http://www.cs.cornell.edu/home/kleinber/auth.pdf
20 11/13 HITS revised slides by Gang Fang.
Eigenfaces
x (Probabilistic Visual Learning for Object Representation, Baback
Moghaddam, Alex Pentland Early Visual Learning, Oxford University
Press, 1996.
http://citeseer.ist.psu.edu/moghaddam96probabilistic.html
08 09/25 Eigenfaces _____________Ravishankar Sivalingam___
x Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection.
P. Belhumeur, J. Hespanha, and D. Kriegman (july 1997).
IEEE Transactions on pattern analysis and machine intelligence 19 (7).
http://dx.doi.org/10.1109/34.598228
21 11/18 eigenfaces vs fisher slides by Guruprasad Somasundaram.
Spectral Clustering
x A Tutorial on Spectral Clustering
Ulrike von Luxburg
http://web.mit.edu/~wingated/www/introductions/tutorial_on_spectral_clustering.pdf
22 11/20 Spectral Clustering preliminary slides by Fanbin Bu.
Spectral Graph Partitioning
x Spectral Graph Partitioning
David Gleich, Leonid Zhukov, Kevin Lang
Slides: http://www.gg.caltech.edu/~zhukov/papers/Spectral_Graph_Partitioning.pdf
23 11/25 Spectral Graph Partitioning revised slides by Yun Liu.
x Co-clustering documents and words using Bipartite Spectral Graph Partitioning
Inderjit S. Dhillon
http://www.cs.utexas.edu/users/inderjit/public_papers/kdd_bipartite.pdf
(Co-clustering topic in this paper discussed subsequently.)
23 11/25 Bipartite Spectral Graph Partitioning preliminary slides by Liqiong Zhao.
Student Project Presentations (10 minutes each)
24 12/02 Student Projects _____Michael Karp_____________________
_____Shuo Guo_________________________
_____Guruprasad Somasundaram__________
_____Zhonghua Jiang___________________
_____Dong Jiao________________________
----------------------------------------
25 12/04 Student Projects _____Joshua Jorenby___________________
_____Liqiong Zhao_____________________
_____Jianxin Fang_____________________
_____Weikang Qian_____________________
_____Fanbin Bu________________________
_____Yun Liu__________________________
----------------------------------------
26 12/09 Student Projects _____Ravishankar______________________
_____Gang Fang________________________
_____Thanh Ngo________________________
_____Anoop Cherian____________________
_____John Navil Joseph________________
_____Shanzhen Chen____________________
_____Vijay & Gyan_____________________