[WIP] Diffuse #29

tjlane · 2017-02-22T00:55:55Z

Adding C++ code for computing the MVN diffuse scatter model.

$ time ./cputest
10000 q-vectors :: 1000 atoms
remember: linear in q-vectors, quadratic in atomsCPP OUTPUT:
0.000000
0.000000

real 0m57.507s
user 0m56.895s
sys 0m0.313s

tjlane · 2017-02-22T16:09:45Z

@apeck12 here is how this works

You can access the python interface to the new code by doing

from thor import _cpuscatter
_cppscatter.cpp_scatter_diffuse(xyzlist, q_grid, atom_types, cromermann_parameters, V)

The (cython-provided) python interface is here:
https://github.com/tjlane/thor/blob/diffuse/src/scatter/cpp_scatter_wrap.pyx#L384

The C++ code implementing the (almost certainly incorrect) correlation matrix is here:
https://github.com/tjlane/thor/blob/diffuse/src/scatter/cpp_scatter.cpp#L444
https://github.com/tjlane/thor/blob/diffuse/src/scatter/cpp_scatter.cpp#L459

Note that V is a huge array representing the tensor V_ij^ab. In python, as indicated in the docstring in cpp_scatter_wrap.pyx,

        V : ndarray, float
            A 4-d array of shape (n_atoms, n_atoms, 3, 3) representing the
            anisotropic Gaussian correlation between atoms. The value
            V[i,j,a,b] is the correlation between atom i in the a direction with
            j in the b direction, where a & b are one of {x/y/z}.

In C-land, this will be accessed as a flattened array, so V[i,j,a,b] in python should correspond to V[9*(num_atoms * i + j) + 3*a + b]. At least that's what I think -- double check me! And feel free to fix up that part of the c-code.

tjlane · 2017-02-22T16:12:51Z

@apeck12 regarding testing & what works now:

What works: If you pass a V array of all zeros, right now you do get the same result as a reference implementation. This is tested here:
https://github.com/tjlane/thor/blob/diffuse/test/scatter/test_scatter.py#L368

Adding new test: Add your reference implementation to the top of the test_scatter.py file and then add a new test in the TestDiffuseScatter class that uses the V tensor.

If you add stuff and it doesn't work right off the bat, push the new test an I can help work to make the code correct.

tjlane · 2017-02-22T16:15:55Z

TODO:

test correctness of code/implement test
provide a high level python interface
benchmark to see if we meet performance requirements

tjlane · 2017-02-23T02:15:27Z

@apeck12 benchmark for a "large" system and one q-vector

$ time ./cputest 
1 q-vectors :: 15000 atoms
remember: linear in q-vectors, quadratic in atoms
CPP OUTPUT:
0.000000
0.000000

real	0m4.248s
user	0m1.917s
sys	0m1.855s

For 10,000,000 q-vectors, we're talking ~11,000 cpu-hours, which is too many! So we need to keep thinking. Note parallelization over q-vecs is trivial.

tjlane · 2017-02-23T02:24:35Z

Going to the isotropic approximation seems to make little difference. Going to GPU is possible/easy.

apeck12 · 2017-02-23T19:09:06Z

I've been using the GPU code exclusively for the Thor CypA and ECR simulations. Do you mind if I switch the test case for TestDiffuseScatter from the 512_atom_benchmark.xyz to pentagon.pdb?

apeck12 · 2017-02-23T19:36:30Z

Also, is there a Cython wrapper for the calculation with isotropic V?

tjlane · 2017-02-23T21:55:05Z

@apeck12

512_atom_benchmark.xyz to pentagon.pdb: go for it
There is not (and probably will never be) a specific isotropic interface. I committed a sketch python interface just now (see scatter.py, commit above). Basically the trick will be to simply fill in a sparse V matrix to do the isotropic case. If we can get a big speedup from specific isotropic code, we will do that, but my tests so far indicate that is not the case.

apeck12 · 2017-02-24T17:39:34Z

@tjlane

Python reference implementation is operational in pull request #31 .

For the pentagon tests, _cppscatter.cpp_scatter_diffuse

produces correct output when V=0, as you had shown for 512_atom_benchmark.xyz.
fails for the test non-zero V matrix, with error: 'Infs detected in scattering output!'

tjlane · 2017-02-24T17:54:57Z

@apeck12 cool. I will work on this and fix things up. Thanks!

Diffuse

revised reference implementation

tjlane · 2017-03-08T20:09:44Z

@apeck12 latest commit includes functional C++ code splitting diffuse/bragg. Speed improvements to come, but you can start using it if you want. Current best estimate for 1M q-vectors & 1500 atoms is ~30,000 cpu-seconds (8 hrs).

Since we are in a rapidly changing development mode, but have decent tests, I would recommend checking out the latest copy and then always running the tests (at least test_scatter.py) before doing calculations we want to trust.

apeck12 · 2017-03-10T18:56:29Z

@tjlane
Receiving the following error when trying to run scatter.simulate_diffuse:
Error in kernel. CUDA error: invalid argument

This is my call to the Thor code:
scatter.simulate_diffuse(pdb, detector, V, ignore_hydrogens=True, device_id=0)

Any thoughts on what might be up? I think it should be the same version of cuda that I was using on the master branch without issue.

tjlane · 2017-03-10T23:36:10Z

@apeck12
For a quick fix, just make sure ignore_hydrogens=False does not fix the problem.
If that doesn't work, can you send me the entire example so I can replicate the problem?

Apeck12 diffuse

tjlane added 3 commits February 21, 2017 16:45

draft of diffuse

61b3432

benchmark

0dbda81

retab

f1e95de

updated readme

efb4426

tjlane and others added 3 commits February 22, 2017 19:13

python interface sketch in and works

42c7304

adding reference V matrices

011f46e

removing test commit

9e8a13e

updated test_scatter.py

ed7a7da

Ariana Peck added 2 commits February 24, 2017 06:09

functional ref implementation; cpp_scatter_diffuse fails for nonzero V

9dea073

python reference implementation added

b57e6fb

tjlane and others added 9 commits February 28, 2017 07:50

Merge pull request #31 from apeck12/diffuse

700557b

Diffuse

cpp implementation works, tests in

c161313

revised ref. implementation

04efde4

updated reference implementation

35bcafa

Merge pull request #32 from apeck12/diffuse

67ddf2a

revised reference implementation

Merge branch 'diffuse' of github.com:tjlane/thor into diffuse

4e93cc0

Diffuse tests workin

d98de1a

cleaned up tests

def4f85

skipping correlate_intra for now...

faa65f7

tjlane and others added 5 commits March 9, 2017 08:42

skip all intra tests

3403e47

speed improved

c9f654a

gpu first draft (probably totally broken, need machine w nvcc)

12b7725

gpu code.... works?git status

6b09efe

try and skip if no gpu

a8c2801

TJ Lane and others added 16 commits March 11, 2017 08:09

retab

435c340

tpb 128 -> 256

f1fb651

added vMF distribution

c8de7ba

Merge branch 'master' into diffuse

f7caa24

added B factor reference implementation

66bdb9c

added reference implementation for B factors

b416342

added python reference implementation for b-factors

25aadce

added python reference implementation for b-factors

54ebdcb

added python reference implementation for b-factors

4a59e7a

B-factors implemented and functioning for CPU

6b57f4c

added tests of updated CppScatter Python interface

835e9d5

revised nosetests; now passes GPU tests

80a693f

updated error messages

3b83ba2

center finding bug

33f0c55

added meaningless benchmark test

2ae9763

Merge pull request #35 from tjlane/apeck12-diffuse

f94a848

Apeck12 diffuse

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Diffuse #29

[WIP] Diffuse #29

tjlane commented Feb 22, 2017

tjlane commented Feb 22, 2017 •

edited

Loading

tjlane commented Feb 22, 2017

tjlane commented Feb 22, 2017

tjlane commented Feb 23, 2017

tjlane commented Feb 23, 2017

apeck12 commented Feb 23, 2017

apeck12 commented Feb 23, 2017

tjlane commented Feb 23, 2017

apeck12 commented Feb 24, 2017

tjlane commented Feb 24, 2017

tjlane commented Mar 8, 2017

apeck12 commented Mar 10, 2017

tjlane commented Mar 10, 2017

[WIP] Diffuse #29

Are you sure you want to change the base?

[WIP] Diffuse #29

Conversation

tjlane commented Feb 22, 2017

tjlane commented Feb 22, 2017 • edited Loading

tjlane commented Feb 22, 2017

tjlane commented Feb 22, 2017

tjlane commented Feb 23, 2017

tjlane commented Feb 23, 2017

apeck12 commented Feb 23, 2017

apeck12 commented Feb 23, 2017

tjlane commented Feb 23, 2017

apeck12 commented Feb 24, 2017

tjlane commented Feb 24, 2017

tjlane commented Mar 8, 2017

apeck12 commented Mar 10, 2017

tjlane commented Mar 10, 2017

tjlane commented Feb 22, 2017 •

edited

Loading