Talk:Chromosome structure via Euclidean Distance Matrices
From Wikimization
(→E.coli realization) |
|||
| Line 19: | Line 19: | ||
clear D Vn | clear D Vn | ||
| - | [evec evals flag] = eigs(VDV, [], 20, ' | + | [evec evals flag] = eigs(VDV, [], 20, 'LA'); |
if flag, disp('convergence problem'), return, end; | if flag, disp('convergence problem'), return, end; | ||
Revision as of 17:33, 7 August 2008
%%% Ronan Fleming, E.coli molecule data
%%% -Jon Dattorro, August 2008
clear all
load ecoli
frame = 4; % 1 through 12
G = her49imfs12movfull(frame).cdata; % uint8
G = (double(G)-128)/128; % Gram matrix
N = size(G,1);
D = diag(G)*ones(N,1)' + ones(N,1)*diag(G)' - 2*G; % EDM D
clear her49imfs12movfull G;
Vn = [-ones(1,N-1); speye(N-1)];
VDV = (-Vn'*D*Vn)/2;
clear D Vn
[evec evals flag] = eigs(VDV, [], 20, 'LA');
if flag, disp('convergence problem'), return, end;
close all
Xs = sqrt(real(evals(1:3,1:3)))*real(evec(:,1:3))'; % Projection of -VDV on PSD cone rank 3
plot3(Xs(1,:), Xs(2,:), Xs(3,:), '.')
E.coli realization
I regard the autocorrelation data you provided as a Gram matrix.
Then conversion to an EDM is straightforward - Chapter 5.4.2 of Convex Optimization & Distance Geometry.
The program calculates only the first 20 eigenvalues of the projection of the EDM on a positive semidefinite (PSD) cone.
You can see at runtime that there are many significant eigenvalues; which means, the Euclidean body (the molecule) lives in a space higher than dimension 3, assuming I have interpreted the E.coli data correctly.
To get a picture corresponding to physical reality, we project the EDM on that PSD cone, rank 3 subset; this means, precisely, we truncate eigenvalues.
It is unlikely that this picture is an accurate representation unless the number of eigenvalues of the EDM projection approaches 3 to begin with.
Matlab Figures allow 3D rotation in real time, so you can get a good idea of the body's shape.
I include a low-resolution figure here (frame 4) for reference.
