CSci 8363 -- Fall 2008 -- List of Papers

Tentative early list of papers to be presented during the semester.

General

x   Linear Algebra in Data Exploration - Examples

    01 09/02 Intro - Examples.                  "bio.pdf" - Examples of Linear Algebra in data Mining (Boley)

    02 09/04 Review of linear algebra           "intro.pdf" - Intriductory/Review slides

Basics of Eigenvalues, PCA definition

x   A tutorial on Principal Components Analysis
    Lindsey I Smith
    http://www.cs.otago.ac.nz/cosc453/student_tutorials/principal_components.pdf
    
    03 09/09 PCA + Elem Stat                    Boley

x   A Tutorial on Principal Component Analysis
    Jonathon Shlens
    http://www.snl.salk.edu/~shlens/pub/notes/pca.pdf
    (also "http://www.dgp.toronto.edu/~aranjan/tuts/pca.pdf")

    04 09/11 PCA by example, PCA<->SVD       "PCA.pdf ", by  Boley

Information retrieval

x   Computational Methods for Intelligent Information Access. M.W.
    Berry, S.T. Dumais, and T.A. Letsche.Proceedings of Supercomputing'95,
    San Diego, CA, December 1995.
    http://citeseer.ist.psu.edu/438263.html
    http://hpc.isti.cnr.it/~palmeri/datam/articles/SC95.ps

    05 09/16 Latent Semantic Indexing   slides by Vijay Kumar Adhikari

x   Latent semantic indexing via a semi-discrete matrix decomposition
    Tamara G. Kolda and Dianne P, O'Leary, in The Mathematics
    of Information Coding, Extraction and Distribution, G. Cybenko et al.,
    eds., vol. 107 of IMA Volumes in Mathematics and Its Applications.
    Springer-Verlag, 1999, pp. 73-80.
    http://portal.acm.org/citation.cfm?id=291131

    06 09/18 Semi-discrete matrix decomp.       slides by Joshua Jorensby

Clustering using vector-based similarities

x   Outlier Detection Using SemiDiscrete Decomposition (2002) 
    S. McConnell, D. Skillicorn
    http://citeseer.ist.psu.edu/mcconnell02outlier.html

    07 09/23 SDD outlier application    slides by Bryan Decaire

Eigenfaces (part I)

x   (Probabilistic Visual Learning for Object Representation, Baback
    Moghaddam, Alex Pentland Early Visual Learning, Oxford University
    Press, 1996.
    http://citeseer.ist.psu.edu/moghaddam96probabilistic.html

    08 09/25 Eigenfaces                 slides by Ravishankar Sivalingam

Clustering using vector-based similarities (continued) 

x   Concept Decompositions for Large Sparse Text Data using Clustering.
    I.S. Dhillon, D.S. Modha, IBM Research Report RJ 10147, July 8, 1999,
    Machine Learning, 42:1, pages 143-175, January 2001.
    http://www.cs.utexas.edu/users/inderjit/public_papers/concept_mlj.pdf

    09 09/30 Concept Decompositions     slides by Michael Karp

x   Concept Decomposition Using Clustering
    I.S. Dhillon, D.S. Modha
    Patent 6560597
    http://www.google.com/patents?id=ghQPAAAAEBAJ&dq=6560597.

    10 10/07 Concept Decomp: Patent             slides by Boley

x   Streaming Data Reduction Using Low-Memory Factored Representations.
    David Littau and Daniel Boley.
    Journal of Information Sciences, 176(14):2016-2041, Elsevier, 2006.
    http://www-users.cs.umn.edu/~boley/publications/papers/Streaming05.pdf

    ?? ??/?? Application: streaming data        _____EMPTY____________________

    xx 10/02 on travel

x   Video Google: A Text Retrieval Approach to Object Matching in Videos
    Sivic, J. and Zisserman, A.
    Proceedings of the International Conference on Computer Vision (2003) 
    http://www.robots.ox.ac.uk/~vgg/publications/papers/sivic03.pdf
    Demo: http://www.robots.ox.ac.uk/~vgg/research/vgoogle/index.html
    Slides: https://www.ipam.ucla.edu/publications/sews2/sews2_7272.pdf

    10 10/09 Application: images        slides by Weikang Qian

x   Fast Monte Carlo Algorithms for Matrices I: Approximating Matrix Multiplication
    Fast Monte Carlo Algorithms for Matrices II: Computing a Low Rank Approximation to a Matrix
    P. Drineas, R. Kannan, and M.W. Mahoney
    SIAM J Computing 2005
    I: http://www.cs.rpi.edu/~drinep/matrixI_SICOMP.pdf
    II: http://www.cs.rpi.edu/~drinep/matrixII_SICOMP.pdf
    Related Slides: http://www.cs.rpi.edu/~drinep/SDMtutorial.ppt

    11 10/14 Randomized Matrix Approx   slides by Dong Jiao
    (slides accessible only locally from 'umn.edu').

SVD/PCA in Biology -- Microarray analysis. (choose one)

x   Singular value decomposition for genome-wide expression data processing and modeling
    Orly Alter, Patrick O. Brown, David Botstein
    Proc Natl Acad Sci U S A. 2000 August 29; 97(18): 10101-10106.
    http://www.pubmedcentral.nih.gov/articlerender.fcgi?artid=27718

x   Example of how SVD/PCA are used to analyse data - using example from Biology
    Singular value decomposition and principal component analysis.
    Michael E.  Wall
    http://public.lanl.gov/mewall/kluwer2002.html
   
    12 10/16 PCA in Biology             slides by Gyan Ranjan.
                                        PDF version of slides.

x   (Generalized singular value decomposition for comparative analysis of genome-scale expression data sets of two different organisms
    Orly Alter, Patrick O. Brown, and David Botstein
    PNAS,  March 18, 2003,  vol. 100,  no. 6, pp 3351-3356 
    http://www.pnas.org/cgi/content/abstract/100/6/3351)

Linear discrinimant Analsys (LDA)

x   Fisher Linear Discriminant Analysis
    Max Wellman
    http://www.cs.huji.ac.il/~csip/Fisher-LDA.pdf

    13 10/21 Basic LDA shortintro       slides by Zhonghua Jiang.

    
x   Developmental Stage Annotation of Drosophila Gene Expression Pattern Images via an Entire Solution Path for LDA
    Jieping Ye, Jianhui Chen, Ravi Janardan, Sudhir Kumar
    http://doi.acm.org/10.1145/1342320.1342324

    14 10/23 LDA example use            slides by Jianxin Fang.

Multidimensional Scaling (one paper to be chosen)

x   MULTIDIMENSIONAL SCALING
    Forrest W. Young
    http://forrest.psych.unc.edu/teaching/p208a/mds/mds.html

x   Multidimensional Scaling
    Stephen P. Borgatti
    http://www.analytictech.com/borgatti/mds.htm

    15 10/28 MDS                        final slides by Anoop Cherian.

ISOMAP, Local Linear Embedding and visualization (one paper to be chosen) 

x   An Introduction to Locally Linear Embedding. Lawrence Saul & Sam
    Roweis. [draft version (Jan.01)]
    http://www.cs.toronto.edu/%7Eroweis/lle/papers/lleintro.pdf
    (also "http://www.cs.toronto.edu/~roweis/lle/publications.html")

    16 10/30 Isomap + Locally Linear Embedding  slides by Thanh Ngo.

Non-Negative Matrix Factorization

x   Sparse Non-negative Matrix Factorizations via Alternating
    Non-negativity-constrained Least Squares for Microarray Data Analysis,
    H Kim and H Park
    Bioinformatics, 23-12:1495-1502, 2007.
    http://www.cc.gatech.edu/~hpark/papers/kp07snmf.pdf

    17 11/04 NMF in micro array         slides by John Joseph.
 
Link Analysis -- PageRank. HITS

x   The PageRank Citation Ranking: Bringing Order to the Web.Page,
    Lawrence; Brin, Sergey; Motwani, Rajeev; Winograd, Terry. Stanford
    Univ. Computer Science Dept technical report. Oct. 2001
    http://citeseer.ist.psu.edu/page98pagerank.html

    18 11/06  PageRank idea             slides by Shuo Guo.

x   Deeper Inside PageRank
    Amy N. Langville and Carl D. Meyer
    Internet Mathematics Vol. 1, No. 3: 335-380
    http://projecteuclid.org/DPubS?service=UI&version=1.0&verb=Display&handle=euclid.im/1109190965
    http://projecteuclid.org/euclid.im/1109190965

    19 11/11 Inside Pagerank            revised slides by Shanzhen Chen.

x   Authoritative Sources in a Hyperlinked Environment
    Jon M. Kleinberg
    Proc. 9th ACM-SIAM Symposium on Discrete Algorithms, 1998.
    http://www.cs.cornell.edu/home/kleinber/auth.pdf

    20 11/13 HITS                       revised slides by Gang Fang.

Eigenfaces

x   (Probabilistic Visual Learning for Object Representation, Baback
    Moghaddam, Alex Pentland Early Visual Learning, Oxford University
    Press, 1996.
    http://citeseer.ist.psu.edu/moghaddam96probabilistic.html

    08 09/25 Eigenfaces                 _____________Ravishankar Sivalingam___

x   Eigenfaces vs. Fisherfaces: Recognition Using Class Specific Linear Projection.
    P. Belhumeur, J. Hespanha, and D. Kriegman (july 1997).
    IEEE Transactions on pattern analysis and machine intelligence 19 (7).
    http://dx.doi.org/10.1109/34.598228

    21 11/18 eigenfaces vs fisher       slides by Guruprasad Somasundaram.

Spectral Clustering

x   A Tutorial on Spectral Clustering
    Ulrike von Luxburg
    http://web.mit.edu/~wingated/www/introductions/tutorial_on_spectral_clustering.pdf

    22 11/20 Spectral Clustering        preliminary slides by Fanbin Bu.

Spectral Graph Partitioning

x   Spectral Graph Partitioning
    David Gleich, Leonid Zhukov, Kevin Lang
    Slides: http://www.gg.caltech.edu/~zhukov/papers/Spectral_Graph_Partitioning.pdf

    23 11/25 Spectral Graph Partitioning        revised slides by Yun Liu.

x   Co-clustering documents and words using Bipartite Spectral Graph Partitioning
    Inderjit S. Dhillon
    http://www.cs.utexas.edu/users/inderjit/public_papers/kdd_bipartite.pdf
    (Co-clustering topic in this paper discussed subsequently.)

    23 11/25 Bipartite Spectral Graph Partitioning      preliminary slides by Liqiong Zhao.

Student Project Presentations (10 minutes each)

    24 12/02 Student Projects         _____Michael Karp_____________________

                                      _____Shuo Guo_________________________

                                      _____Guruprasad Somasundaram__________

                                      _____Zhonghua Jiang___________________

                                      _____Dong Jiao________________________
   ----------------------------------------
    25 12/04 Student Projects         _____Joshua Jorenby___________________

                                      _____Liqiong Zhao_____________________

                                      _____Jianxin Fang_____________________

                                      _____Weikang Qian_____________________

                                      _____Fanbin Bu________________________

                                      _____Yun Liu__________________________
   ----------------------------------------
    26 12/09 Student Projects         _____Ravishankar______________________

                                      _____Gang Fang________________________

                                      _____Thanh Ngo________________________

                                      _____Anoop Cherian____________________

                                      _____John Navil Joseph________________

                                      _____Shanzhen Chen____________________

                                      _____Vijay & Gyan_____________________