Data source Info / Ref Notes

Some Notes on Data source Info / Data source Ref:

Terms I'm using for these notes:
- DatasourceRef: is model that includes (uid+type), is saved by dashboards and alerting (note: alerting is current UID only)
- DatasourceInfo: is full information that a datasource neededs to execute a request, and can be looked up by the Ref
Contexts:
- Alerting: The Ref is saved. Alerts run as a service, so in this context there is no user (although perhaps there should be a service user or role). Therefore for saved alerts run by the service, authorization is done at creation time, not run time - (as users would not want alerts to stop working if a user is removed).
- Query API: Runtime, in this context there is a user
Main Question:
- What Services are responsbile for what for data source execution?
- Where trailing "What" is:
  - Authorizing a User (or Service) to a data source (or data source ref?), and who caches this?
  - Taking a data source ref and getting data source info
  - Decryption of secrets within data source info?
Considerations:
- Security (e.g. Defense in depth, services not having secrets they don't need)
- Performance (e.g. not looking up things twice)
- Making things services - and the stateless aspect (TBH "stateless" in this context isn't entirely clear to me)
Misc Notes:
- SSE doesn't need data source info, only the ref, I think only data source plugins and editing of data sources needs the full info
A Possible Architecture
- Service to Authorize Datasource Ref to Users (To be called at borders (e.g. APIs or exported services))
- Another service to Translate Datasource Ref -> Datasource Info (to be called by internal services)
- With this, things like recorded queries would use the Data Source Ref Authorization service, SSE as well only sees the Datasoure Ref but isn't responsbile for authorization. Data sources get info (I imagine an execute queries call could use the ref to fetch data source info before sending the request to a data source).

Or is it just how expressions work.

It is just how it works. This gets into my type stuff, but basically they are all converted to one format (Many - one frame per thing (time series or number)) so other operations can work with things in a consistent way. It doesn't have to be this way per say, in the past the code kept things as is, and used specifier data and methods to get things, but this ended up being a bit unwieldy. After types work it is perhaps more feasible.

Yes I know and I tend to agree :) But it doesn't feel optimal that each consumer have to parse and understand each and every query when the requests comes in just for the sake of checking permissions when the actual parsing and evaluation is happening later.

This is one of the reasons I was thinking perhaps the DatasourceRef distinction is important. It is looping over every query, but only to get that DatasourceRef property which is a Grafana reserved property / known type. I guess this also has the underlying assumption that the border (API / user interaction) layers should check permissions.

kylebrandt/notes.md

kylebrandt commented Nov 15, 2021

marefr commented Nov 17, 2021

kylebrandt commented Nov 17, 2021