SDD memory error & d-DNNF example ignoring #118

Zarach · 2024-06-13T15:46:46Z

Dear problog team,

I try to learn a Naive Bayes classifier for documents based on occuring words.

There are about 1000 examples and they should be classified (to be precise, the probability of the words, used at a class should be learned) like in your online example but to one of 4 classes.

If I run LFI with SDD a memory error occures.

If I run it with d-DNNF, it ignores nearly all of the given examples.
It runs into the following error because the calculated weight is very low, even at the beginning of the learning process:

    if self.semiring.is_zero(self._get_z()):
        raise InconsistentEvidenceError(context=" during evidence evaluation")

I guess this is the wanted behavior, but could you explain in an abstract way, why examples get ignored from the beginning?
Does it mean, that there is not enough information (not enough words used) in these examples to learn parameters?

The text was updated successfully, but these errors were encountered:

rmanhaeve · 2024-06-17T08:30:41Z

Hi Zarach

Could you perhaps give is an example of this behaviour?

Kind regards,
Robin

Zarach · 2024-06-17T15:09:49Z

Hi Robin,

not that easy, because I run it in python, but I'll try.
Here are 2 files (with 48 examples) which are examples for the ddnf problem which can be used in standalone mode. Hope it will be comparable to the python run.
All examples get ignored when I run with ddnnf.

For the python run I also uploaded a txt-file with the list of examples, in python it is done with the Term() objects which you can't see in the txt file.

examples_small.txt
program_small.txt
example_list.txt

And another example file which should show the memory alloc error for sdd:

examples_big.txt

Kind regards,
Benjamin

rmanhaeve · 2024-07-02T14:09:57Z

Hi Benjamin

It seems that you only give the training data, but there's no program attached. We'll need this as well to look into it in more depth.

Zarach · 2024-07-04T06:54:29Z

Hi Robin,

program_small.txt is the program which should be executable in problog standalone mode.
At least on my side this works and reproduces the problem.

rmanhaeve · 2024-07-05T11:44:54Z

I have solved the issue with the inconsistent evidence error by setting the initial probabilities to t(0.1) for all words, and by using log-space calculations, i.e. by running it with
lfi program_small.pl examples_small.pl --logspace

I'll now look into the memory issue

rmanhaeve · 2024-07-05T12:32:49Z

I have noticed a calloc when using SDDs. Have you tried using -k sddx ?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SDD memory error & d-DNNF example ignoring #118

SDD memory error & d-DNNF example ignoring #118

Zarach commented Jun 13, 2024

rmanhaeve commented Jun 17, 2024

Zarach commented Jun 17, 2024

rmanhaeve commented Jul 2, 2024

Zarach commented Jul 4, 2024

rmanhaeve commented Jul 5, 2024

rmanhaeve commented Jul 5, 2024

SDD memory error & d-DNNF example ignoring #118

SDD memory error & d-DNNF example ignoring #118

Comments

Zarach commented Jun 13, 2024

rmanhaeve commented Jun 17, 2024

Zarach commented Jun 17, 2024

rmanhaeve commented Jul 2, 2024

Zarach commented Jul 4, 2024

rmanhaeve commented Jul 5, 2024

rmanhaeve commented Jul 5, 2024