Uring
Io_uring is an asynchronous I/O API for Linux that uses ring buffers shared between the Linux kernel and userspace to provide an efficient mechanism to batch requests that can be handled asynchronously and in parallel. This module provides an OCaml interface to io_uring that aims to provide a thin type-safe layer for use in higher-level interfaces.
module Region : sig ... end
Region
handles carving up a block of external memory into smaller chunks. This is currently just a slab allocator of a fixed size, on the basis that most IO operations operate on predictable chunks of memory. Since the block of memory in a region is contiguous, it can be used in Uring's fixed buffer model to map it into kernel space for more efficient IO.
A handle for a submitted job, which can be used to cancel it. If an operation returns None
, this means that submission failed because the ring is full.
val create : ?polling_timeout:int -> queue_depth:int -> unit -> 'a t
create ~queue_depth
will return a fresh Io_uring structure t
. Initially, t
has no fixed buffer. Use set_fixed_buffer
if you want one.
val queue_depth : 'a t -> int
queue_depth t
returns the total number of submission slots for the uring t
val exit : 'a t -> unit
exit t
will shut down the uring t
. Any subsequent requests will fail.
Each uring may have associated with it a fixed region of memory that is used for the "fixed buffer" mode of io_uring to avoid data copying between userspace and the kernel.
val set_fixed_buffer :
'a t ->
Cstruct.buffer ->
(unit, [> `ENOMEM ]) Stdlib.result
set_fixed_buffer t buf
sets buf
as the fixed buffer for t
.
You will normally want to wrap this with Region.alloc
or similar to divide the buffer into chunks.
If t
already has a buffer set, the old one will be removed.
Returns `ENOMEM
if insufficient kernel resources are available or the caller's RLIMIT_MEMLOCK resource limit would be exceeded.
val buf : 'a t -> Cstruct.buffer
buf t
is the fixed internal memory buffer associated with uring t
using set_fixed_buffer
, or a zero-length buffer if none is set.
timeout t clock ns d
submits a timeout request to uring t
.
absolute
denotes how clock
and ns
relate to one another. Default value is false
ns
is the timeout time in nanoseconds
module type FLAGS = sig ... end
module Open_flags : sig ... end
Flags that can be passed to openat2
.
val openat2 :
'a t ->
access:[ `R | `W | `RW ] ->
flags:Open_flags.t ->
perm:Unix.file_perm ->
resolve:Resolve.t ->
?fd:Unix.file_descr ->
string ->
'a ->
'a job option
module Linkat_flags : sig ... end
val linkat :
'a t ->
?old_dir_fd:Unix.file_descr ->
?new_dir_fd:Unix.file_descr ->
flags:Linkat_flags.t ->
old_path:string ->
new_path:string ->
'a ->
'a job option
linkat t ~flags ~old_path ~new_path
creates a new hard link.
If new_path
already exists then it is not overwritten.
unlink t ~dir ~fd path
removes the directory entry path
, which is resolved relative to fd
. If fd
is not given, then the current working directory is used. If path
is a symlink, the link is removed, not the target.
module Poll_mask : sig ... end
val poll_add : 'a t -> Unix.file_descr -> Poll_mask.t -> 'a -> 'a job option
poll_add t fd mask d
will submit a poll(2)
request to uring t
. It completes and returns d
when an event in mask
is ready on fd
.
For files, give the absolute offset, or use Optint.Int63.minus_one
for the current position. For sockets, use an offset of Optint.Int63.zero
(minus_one
is not allowed here).
val read_fixed :
'a t ->
file_offset:offset ->
Unix.file_descr ->
off:int ->
len:int ->
'a ->
'a job option
read t ~file_offset fd ~off ~len d
will submit a read(2)
request to uring t
. It reads up to len
bytes from absolute file_offset
on the fd
file descriptor and writes the results into the fixed memory buffer associated with uring t
at offset off
. The user data d
will be returned by wait
or peek
upon completion.
val read_chunk :
?len:int ->
'a t ->
file_offset:offset ->
Unix.file_descr ->
Region.chunk ->
'a ->
'a job option
read_chunk
is like read_fixed
, but gets the offset from chunk
.
val write_chunk :
?len:int ->
'a t ->
file_offset:offset ->
Unix.file_descr ->
Region.chunk ->
'a ->
'a job option
write_chunk
is like write_fixed
, but gets the offset from chunk
.
splice t ~src ~dst ~len d
will submit a request to copy len
bytes from src
to dst
. The operation returns the number of bytes transferred, or 0 for end-of-input. The result is EINVAL
if the file descriptors don't support splicing.
module Statx : sig ... end
val statx :
'a t ->
?fd:Unix.file_descr ->
mask:Statx.Mask.t ->
string ->
Statx.t ->
Statx.Flags.t ->
'a ->
'a job option
statx t ?fd ~mask path stat flags
stats path
, which is resolved relative to fd
(or the current directory if fd
is not given).
connect t fd addr d
will submit a request to connect fd
to addr
.
val accept : 'a t -> Unix.file_descr -> Sockaddr.t -> 'a -> 'a job option
accept t fd addr d
will submit a request to accept a new connection on fd
. The new FD will be configured with SOCK_CLOEXEC
. The remote address will be stored in addr
.
cancel t job d
submits a request to cancel job
. The cancel job itself returns 0 on success, or ENOTFOUND
if job
had already completed by the time the kernel processed the cancellation request.
module Msghdr : sig ... end
val send_msg :
?fds:Unix.file_descr list ->
?dst:Unix.sockaddr ->
'a t ->
Unix.file_descr ->
Cstruct.t list ->
'a ->
'a job option
send_msg t fd buffs d
will submit a sendmsg(2)
request. The Msghdr
will be constructed from the FDs (fds
), address (dst
) and buffers (buffs
).
Requires List.length buffs <= Uring.iov_max
recv_msg t fd msghdr d
will submit a recvmsg(2)
request. If the request is successful then the msghdr
will contain the sender address and the data received.
fsync t ?off ?len fd d
will submit an fsync(2)
request, with the optional offset off
and length len
specifying the subset of the file to perform the synchronisation on.
fdatasync t ?off ?len fd d
will submit an fdatasync(2)
request, with the optional offset off
and length len
specifying the subset of the file to perform the synchronisation on.
You can check which operations are supported by the running kernel.
module Op : sig ... end
val submit : 'a t -> int
val wait : ?timeout:float -> 'a t -> 'a completion_option
wait ?timeout t
will block indefinitely (the default) or for timeout
seconds for any outstanding events to complete on uring t
. This calls submit
automatically.
val get_cqe_nonblocking : 'a t -> 'a completion_option
get_cqe_nonblocking t
returns the next completion entry from the uring t
. It is like wait
except that it returns None
instead of blocking.
val peek : 'a t -> 'a completion_option
val register_eventfd : 'a t -> Unix.file_descr -> unit
register_eventfd t fd
will register an eventfd to the the uring t
. See documentation for io_uring_register_eventfd
error_of_errno e
converts the error code abs e
to a Unix error type.
val active_ops : _ t -> int
active_ops t
returns the number of operations added to the ring (whether submitted or not) for which the completion event has not yet been collected.
val sqe_ready : _ t -> int
sqe_ready t
is the number of unconsumed (if SQPOLL) or unsubmitted entries in the SQ ring.
module Stats : sig ... end
get_debug_stats t
collects some metrics about the internal state of t
.
module Private : sig ... end