#include namespace DB { FileLogDirectoryWatcher::FileLogDirectoryWatcher(const std::string & path_, StorageFileLog & storage_, ContextPtr context_) : path(path_) , storage(storage_) , log(&Poco::Logger::get("FileLogDirectoryWatcher(" + path + ")")) , dw(std::make_unique(*this, path, context_)) { } FileLogDirectoryWatcher::Events FileLogDirectoryWatcher::getEventsAndReset() { std::lock_guard lock(mutex); Events res; res.swap(events); return res; } FileLogDirectoryWatcher::Error FileLogDirectoryWatcher::getErrorAndReset() { std::lock_guard lock(mutex); Error old_error = error; error = {}; return old_error; } const std::string & FileLogDirectoryWatcher::getPath() const { return path; } void FileLogDirectoryWatcher::onItemAdded(DirectoryWatcherBase::DirectoryEvent ev) { std::lock_guard lock(mutex); EventInfo info{ev.event, "onItemAdded"}; std::string event_path = ev.path; if (auto it = events.find(event_path); it != events.end()) { it->second.file_events.emplace_back(info); } else { events.emplace(event_path, FileEvents{.file_events = std::vector{info}}); } } void FileLogDirectoryWatcher::onItemRemoved(DirectoryWatcherBase::DirectoryEvent ev) { std::lock_guard lock(mutex); EventInfo info{ev.event, "onItemRemoved"}; std::string event_path = ev.path; if (auto it = events.find(event_path); it != events.end()) { it->second.file_events.emplace_back(info); } else { events.emplace(event_path, FileEvents{.file_events = std::vector{info}}); } } /// Optimize for MODIFY event, during a streamToViews period, since the log files /// are append only, there are may a lots of MODIFY events produced for one file. /// For example, appending 10000 logs into one file will result in 10000 MODIFY event. /// So, if we record all of these events, it will use a lot of memory, and then we /// need to handle it one by one in StorageFileLog::updateFileInfos, this is unnecessary /// because it is equal to just record and handle one MODIY event void FileLogDirectoryWatcher::onItemModified(DirectoryWatcherBase::DirectoryEvent ev) { std::lock_guard lock(mutex); auto event_path = ev.path; EventInfo info{ev.event, "onItemModified"}; /// Already have MODIFY event for this file if (auto it = events.find(event_path); it != events.end()) { if (it->second.received_modification_event) return; else { it->second.received_modification_event = true; it->second.file_events.emplace_back(info); } } else { events.emplace(event_path, FileEvents{.received_modification_event = true, .file_events = std::vector{info}}); } } void FileLogDirectoryWatcher::onItemMovedFrom(DirectoryWatcherBase::DirectoryEvent ev) { std::lock_guard lock(mutex); EventInfo info{ev.event, "onItemMovedFrom"}; std::string event_path = ev.path; if (auto it = events.find(event_path); it != events.end()) { it->second.file_events.emplace_back(info); } else { events.emplace(event_path, FileEvents{.file_events = std::vector{info}}); } } void FileLogDirectoryWatcher::onItemMovedTo(DirectoryWatcherBase::DirectoryEvent ev) { std::lock_guard lock(mutex); EventInfo info{ev.event, "onItemMovedTo"}; std::string event_path = ev.path; if (auto it = events.find(event_path); it != events.end()) { it->second.file_events.emplace_back(info); } else { events.emplace(event_path, FileEvents{.file_events = std::vector{info}}); } } void FileLogDirectoryWatcher::onError(Exception e) { std::lock_guard lock(mutex); LOG_ERROR(log, "Error happened during watching directory: {}", error.error_msg); error.has_error = true; error.error_msg = e.message(); } }