-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy pathTRAIN_00274.eml
21 lines (17 loc) · 1 KB
/
TRAIN_00274.eml
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
NoneNone[Spambayes] Ditching WordInfo> Yeah, that's exactly what I was doing--I didn't realize I was
> incurring administrative pickle bloat this way. I'm specifically
> trying to make things faster and smaller, so I'm storing individual
> WordInfo pickles into an anydbm dict (keyed by token). The result
> is that it's almost 50 times faster to score messages one per run
> our of procmail (.408s vs 18.851s).
This is very worthwhile.
> However, it *does* say all over the place that the goal of this
> project isn't to make the fastest or the smallest implementation, so
> I guess I'll hold off doing any further performance tuning until the
> goal starts to point more in that direction. .4 seconds is probably
> fast enough for people to use it in their procmailrc, which is what
> I was after.
Maybe. I batch messages using fetchmail (don't ask why), and adding
.4 seconds per message for a batch of 50 (not untypical) feels like a
real wait to me...
--Guido van Rossum (home page: http://www.python.org/~guido/)