kernel technology that allows developers to write custom codes that can be loaded into the kernel dynamically.
it's used for:
- performance tracing
- networking
- detecting malicious activity
BPF stands for Berkeley Packet Filterting
, first described in 1993 paper written by Berkeley.
ldh [12]
jeq #ETHERTYPE IP, L1, L2
L1: ret #TRUE
L2: ret #0
we can load the eBPF dynamically, no need to reboot, rebuild
it evolves to what we call - eBPF, when kernel version 3.18 in 2014.
- kernel: software layer between our application runs (user space) and the hardware.
- user application cannot accesss hw directly, we're accessing the hw via the kernel,
- r/w files
- accessing memory
- r/s network packets
By using eBPF, we can intercept the system call from userspace applications (ex. what file application try to r/w)
eBPF is very useful for the Cloud-Native environment, these days many applications are running on the Cloud Services, that is, Fargate, GKE etc. these workload is working on the server at last, and if we apply the eBPF to the workload, we can observer the behaviours of all workloads that running on the same server.
as similar technology, service mesh, sidecar container exist, but they have some downside:
- the application pod has to be restarted for the sidecar to be added
- we may need to modify YAML configs to apply sidecar
- performance issue as below:
#!/usr/bin/python
from bcc import BPF
program = r"""
int hello(void *ctx) {
bpf_trace_printk("Hello World!");
return 0;
}
"""
b = BPF(text=program)
syscall = b.get_syscall_fnname("execve")
b.attach_kprobe(event=syscall, fn_name="hello")
b.trace_print()
here's the source code written in C:
int hello(void *ctx) {
bpf_trace_printk("Hello World!");
return 0;
}
all the eBPF program does is use a helper funtion, bpf_trace_printk()
to write a message
then,
syscall = b.get_syscall_fnname("execve")
b.attach_kprobe(event=syscall, fn_name="hello")
at here, we're getting the target system probe and create trap fot it via kprobe .
at this point, eBPF program is loaded into the kernel.
after that, by b.trace_print()
, we can print the output from the kernel.
execute sample:
data:image/s3,"s3://crabby-images/2825f/2825fda34131f13f752ac9806c560368184aacbe" alt="image"
map is a data structure that can be accessed from the eBPF/userspace applications. it's used for:
- storing configuration data
- communicate between other eBPF
- store results/metrics for let userspace application read them
https://github.com/kubosuke/ebpf-playground/blob/main/counter.py