Using HTTP GET requests, you can query for all kinds of historical data from the Fedora Messaging bus: events by username, by package, by message source, by topic… you name it.
Datagrepper is a web application and JSON API to retrieve historical messages sent via Fedora Messaging. Datanommer is a seperate project and service that consumes messages from the Fedora Messaging queue and puts them in a database. These messages is what datagrepper queries.
Datagrepper is curently running in production at https://apps.fedoraproject.org/datagrepper/
The list of possible topics can be found in Fedora Messaging's documentation.
As an alternative to cURL, HTTPie is a tool you can use to interact with a JSON API, like datagrepper. All examples in this guide use HTTPie. Use this command to install it on Fedora:
sudo dnf install httpie
HTTP GET arguments start
and end
can be either POSIX timestamps or
date/time strings.
Default value for end
is the current time unless both start
and
delta
are set.
datagrepper takes time arguments in seconds. So, we need to convert two days to 172,800 seconds first. Then, we can use HTTPie to get the JSON payload:
http get https://apps.fedoraproject.org/datagrepper/v2/search delta==172800
To get messages in fixed absolute time range, we can use a date/time string, for example the common ISO 8601 format:
http get https://apps.fedoraproject.org/datagrepper/v2/search end==2021-06-25T06:11:40+00:00 start==2021-06-25T06:11:39+00:00
The previous example is a large JSON response that's too big to read through. Limit the number of results to make it more digestable:
http get https://apps.fedoraproject.org/datagrepper/v2/search delta==172800 rows_per_page==1
{ "arguments": { "delta": 1728000.0, "end": 1366221938.0, "page": 1, "rows_per_page": 1, "order": "desc", "start": 1364493938.0, "topics": [], "agents": [], "categories": [], "users": [] "packages": [], "not_topics": [], "not_agents": [], "not_categories": [], "not_users": [] "not_packages": [], }, "count": 1, "pages": 2052, "raw_messages": [ ... ], "total": 2052 }
In this example, raw_messages
was omitted for readability.
Notice a few things.
start
and end
included (derived from your delta
)rows_per_page
shows the rows per page, its sibling value
page
is pointer to "page" of data you are onUse this command to get to the next page:
http get https://apps.fedoraproject.org/datagrepper/v2/search \ delta==172800 \ rows_per_page==1 \ page==2
{ "arguments": { "delta": 1728000.0, "end": 1366221938.0, "page": 2, "rows_per_page": 1, "order": "desc", "start": 1364493938.0, "topics": [], "agents": [], "categories": [], "users": [] "packages": [], "not_topics": [], "not_agents": [], "not_categories": [], "not_users": [] "not_packages": [], }, "count": 1, "pages": 2052, "raw_messages": [ ... ], "total": 2052 }
The number of rows are retrieved from newest to oldest ("descending"). The
order
argument lets you specify that. The default is desc
, but you can
set it to asc
for ascending order (i.e. oldest to newest).
Specify a category
to limit your message to one kind of topic:
http get https://apps.fedoraproject.org/datagrepper/v2/search \ delta==172800 \ category==bodhi
Here, category
is singular but comes back in the arguments
dict as
categories (plural)! You can specify multiple categories and messages that
match either category will return. They are OR
'd together:
http get https://apps.fedoraproject.org/datagrepper/v2/search \ delta==172800 \ category==bodhi \ category==wiki
Search for events relating to multiple users with this query:
http get https://apps.fedoraproject.org/datagrepper/v2/search \ delta==172800 \ user==toshio \ user==pingou
Same for packages:
http get https://apps.fedoraproject.org/datagrepper/v2/search \ delta==172800 \ package==nethack
For each positive filter, there is a corresponding negative filter. If you want to query all messages except for Koji messages, use this query:
http get https://apps.fedoraproject.org/datagrepper/v2/search \ delta==172800 \ not_category==buildsys
Positive and negative filters are combinable. This query returns all messages
except for user toshio
's Ask Fedora activity:
http get https://apps.fedoraproject.org/datagrepper/v2/search \ delta==172800 \ user==toshio \ not_category==askbot
Multiple category
, user
, and package
filters are merged together in
a way that looks like Conjunctive Normal Form (CNF).
The following query returns all messages from the past two days where (category==bodhi OR category==wiki) AND (user==toshio OR user==pingou):
http get https://apps.fedoraproject.org/datagrepper/v2/search \ delta==172800 \ category==bodhi \ category==wiki \ user==toshio \ user==pingou
If you get stuck, join #fedora-apps
on libera to ask questions. Or, if
everything is awesome, we welcome high-fives and karma cookies.