A critical error when working with POLL and EPOLL?

In addition to the previous [issue](https://github.com/sipwise/rtpengine/issues/1935).

https://github.com/sipwise/rtpengine/blob/fdcee65ab73bbaec4495ad9e936c8246c5ef9a74/daemon/media_socket.c#L3120-L3126

After adding changes to this source code, the functionality for resetting active_read_events and error_strikes to 0, 
the problem was not completely resolved.


There is a suggestion of a critical error in the current implementation of the poller_poll() and epoll_events() 
functions.

https://github.com/sipwise/rtpengine/blob/76dd9ab56d3f3e34ab22c0bbfbfddbf0686e7f40/lib/poller.c#L84-L90

The EPOLLET flag has been added to the implementation of the [epoll_events()](https://github.com/sipwise/rtpengine/blob/76dd9ab56d3f3e34ab22c0bbfbfddbf0686e7f40/lib/poller.c#L84) function. However, the [poller_pool()](https://github.com/sipwise/rtpengine/blob/76dd9ab56d3f3e34ab22c0bbfbfddbf0686e7f40/lib/poller.c#L180) 
function analyzes the polling constants (POLLERR, POLLHUP), which is incorrect. As a result, frozen RTP sessions 
appear 10-15 seconds after the call. Removing the EPOLLET solves the problem of hanging, but it leads to CPU 
overload (constant uncontrolled wakeups).

What if you tweak the epoll_event() and poller_poll() functions, change the POLL constants to EPOLL, this will lead to the 
absence of "frozen sessions", low CPU load, and correct operation with epoll constants.

New version of epoll_events():

```.cpp
    return EPOLLHUP | EPOLLERR | EPOLLET | EPOLLRDHUP | EPOLLPRI |
		  ((it->writeable && ii && ii->blocked) ? EPOLLOUT | EPOLLWRNORM | EPOLLWRBAND : 0) |
		  (it->readable ? EPOLLIN |  EPOLLRDNORM | EPOLLRDBAND : 0);
```
		  
And in the new version of the poller_poll() function, analyze these flags:

```.cpp 
... 
if (ev->events & (EPOLLERR | EPOLLHUP | EPOLLRDHUP))
  it->item.closed(it->item.fd, it->item.obj);
else {
  if (ev->events & (EPOLLIN | EPOLLPRI | EPOLLRDNORM | EPOLLRDBAND)) {
    if (it->item.readable)
      it->item.readable(it->item.fd, it->item.obj);
  }
  else
    if (ev->events & (EPOLLOUT | EPOLLWRNORM | EPOLLWRBAND)) {
      mutex_lock(&p->lock);
      it->blocked = 0;

      ZERO(e);
      ...
    }
...
```

What do you think it's correct fix or not?

Thanks in advance!

	if (strikes >= MAX_RECV_LOOP_STRIKES) {
	ilog(LOG_WARN \| LOG_FLAG_LIMIT, "UDP receive queue exceeded %i times: "
	"discarding packet", strikes);
	// Polling is edge-triggered so we won't immediately get here again.
	// We could remove ourselves from the poller though. Maybe call stream_fd_closed?
	return;
	}

	static int epoll_events(struct poller_item it, struct poller_item_int ii) {
	if (!it)
	it = &ii->item;
	return EPOLLHUP \| EPOLLERR \| EPOLLET \|
	((it->writeable && ii && ii->blocked) ? EPOLLOUT : 0) \|
	(it->readable ? EPOLLIN : 0);
	}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A critical error when working with POLL and EPOLL? #1939

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

A critical error when working with POLL and EPOLL? #1939

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions