Watchtower + Multiprocessing #31

Audace · 2016-11-24T02:07:58Z

The logger is not successfully writing to CloudWatch when using multiprocessing. I tested to see whether this was my configuration by dropping a watchtower handler and using a file handler. This logged perfectly, however, when switching back to the watchtower handler only messages before and after outputs = pool.map(worker, inputs) worked.

Any idea how to fix this? Setting use_queues to True didn't help.

Sample code:

import watchtower
import logging
from multiprocessing import Lock, Process, Queue, current_process, Manager, Pool

def worker(var):
    logger.debug("Incoming variable: %s" % var)
    logger.debug("Outgoing variable: %s" % (var+1))
    return var+1

def main():
    inputs = []
    for i in xrange(1000):
        inputs.append(i)

    logger.debug("Starting run now!")
    pool = Pool(processes=3)
    outputs = pool.map(worker, inputs)
    pool.close()
    pool.join()
    logger.debug("Just finished run")

if __name__ == "__main__":
    logger = logging.getLogger("multi")
    logger.setLevel(logging.DEBUG)

    fh = logging.FileHandler("test.log")
    fh.setLevel(logging.DEBUG)
    logger.addHandler(fh)

    wt_project_handler = watchtower.CloudWatchLogHandler(stream_name="test",
                                                         use_queues=True)
    wt_project_handler.setLevel(logging.DEBUG)
    logger.addHandler(wt_project_handler)

    main()

The text was updated successfully, but these errors were encountered:

Audace · 2016-11-24T03:41:21Z

This now works when I set use_queues to False. However, I now get the following two errors:

TypeError: ('__init__() takes exactly 3 arguments (2 given)',
<class 'botocore.exceptions.ClientError'>, (u'An error occurred (InvalidSequenceTokenException)
when calling the PutLogEvents operation: The given sequenceToken is invalid. The next expected
sequenceToken is: 49566986361376648647772148512500932485923247621329154002',))

and

An error occurred (ThrottlingException) when calling the PutLogEvents operation (reached max
retries: 4): Rate exceeded

Audace · 2016-11-25T15:08:08Z

Solved this issue by using this repo: https://github.com/jruere/multiprocessing-logging, which was spun out of this post: http://stackoverflow.com/questions/641420/how-should-i-log-while-using-multiprocessing-in-python.

All it resulted in was importing multiprocessing_logging and then adding multiprocessing_logging.install_mp_handler(logger)

kislyuk · 2017-03-26T19:11:19Z

Thanks, I probably need to add docs on how to deal with this, so reopening this issue to keep track of that.

spetoolio · 2018-11-04T17:43:35Z

Just experienced this issue, thanks for the fix @Audace!

@kislyuk it may be worth updating the docs to include a reference to that library. I'm working with django rq and any logging within a worker process was not making it into the watchtower batch. I'd assume other worker libraries like Celery would also experience this issue. Similar to @Audace 's answer, I updated my django app's ready() function (guaranteeing the install_mp_handler() is called after logging is setup and before any logs are sent)

from django.apps import AppConfig

class MyAppConfig(AppConfig):
    name = 'MyApp'

def ready(self):
    import multiprocessing_logging
    import logging
    multiprocessing_logging.install_mp_handler(logging.getLogger("my_apps_primary_logger"))

spetoolio · 2018-12-03T20:27:49Z

Just wanted to add that although this seemed solved, I still encounter situations where all logging stops reaching CloudWatch suddenly, but continues to successfully log to local log files. I do not see any obvious causes, and after restarting my workers everything is back to normal as if nothing was wrong. If anyone has any thoughts, please feel free to share, and I will update if I come across a solution. @Audace not sure if you saw anything similar after implementing your solution?

redixhumayun · 2020-12-29T17:43:57Z

@kislyuk Is this the suggested way to deal with multi-process logging for this library?

kislyuk · 2021-01-01T17:08:48Z

The suggested way to use logging in multiprocessing pools is to share nothing. Use one logger per worker process (or thread) and initialize the logger after forking. A shared logger will not work correctly with multiprocessing due to the stateful nature of the logger and race conditions that will arise between different copies of the logger in the different processes.

redixhumayun · 2021-01-02T07:06:35Z

@kislyuk So, I'm not using a multiprocessing pool, just manually creating all the child processes I want.

If I pass a separate logger instance to each child process, does each logger instance log to a separate file / stream? Is there a way to collate everything based on based on some metric(like timestamp)?

kislyuk · 2021-01-02T15:29:14Z

@redixhumayun that is for you to configure. Check the project documentation for how to configure the log stream names. Check the cloudwatch documentation for how to collate logs.

redixhumayun · 2021-02-09T12:30:58Z

I wanted to post my solution here, just in case it is useful to anyone else.

The Python Logging Cookbook has some great examples of how to log to a single file form multiple processes, a sample of that can be found here.

The sample linked above makes use of the QueueHandler class that accepts logs from multiple processes and outputs them somewhere else. Since it is a queue, log order is maintained.

Integrating with WatchTower is as easy as adding an additional handler at the output end of the queue.

You can use the sample code from the link above and just modify the listener_configurer function as below

def listener_configurer():
    root = logging.getLogger()
    h = logging.handlers.RotatingFileHandler('mptest.log', 'a', 300, 10)
    watchtower_handler = watchtower.CloudWatchLogHandler()
    f = logging.Formatter('%(asctime)s %(processName)-10s %(name)s %(levelname)-8s %(message)s')
    h.setFormatter(f)
    watchtower_handler.setFormatter(f)
    root.addHandler(h)
    root.addHandler(watchtower_handler)

Audace closed this as completed Nov 25, 2016

kislyuk reopened this Mar 26, 2017

terencehonles mentioned this issue Jan 25, 2021

reset internal state on fork to prevent deadlocks in worker threads #139

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Watchtower + Multiprocessing #31

Watchtower + Multiprocessing #31

Audace commented Nov 24, 2016 •

edited

Loading

Audace commented Nov 24, 2016 •

edited

Loading

Audace commented Nov 25, 2016

kislyuk commented Mar 26, 2017

spetoolio commented Nov 4, 2018

spetoolio commented Dec 3, 2018 •

edited

Loading

redixhumayun commented Dec 29, 2020

kislyuk commented Jan 1, 2021 •

edited

Loading

redixhumayun commented Jan 2, 2021

kislyuk commented Jan 2, 2021

redixhumayun commented Feb 9, 2021

Watchtower + Multiprocessing #31

Watchtower + Multiprocessing #31

Comments

Audace commented Nov 24, 2016 • edited Loading

Audace commented Nov 24, 2016 • edited Loading

Audace commented Nov 25, 2016

kislyuk commented Mar 26, 2017

spetoolio commented Nov 4, 2018

spetoolio commented Dec 3, 2018 • edited Loading

redixhumayun commented Dec 29, 2020

kislyuk commented Jan 1, 2021 • edited Loading

redixhumayun commented Jan 2, 2021

kislyuk commented Jan 2, 2021

redixhumayun commented Feb 9, 2021

Audace commented Nov 24, 2016 •

edited

Loading

Audace commented Nov 24, 2016 •

edited

Loading

spetoolio commented Dec 3, 2018 •

edited

Loading

kislyuk commented Jan 1, 2021 •

edited

Loading