Issue #1: GSOC: design discussion meeting - fedora-iot

fedora-iot

#1 GSOC: design discussion meeting

Closed: fixed 4 years ago by pbrobinson. Opened 5 years ago by dustymabe.

For the 2018 IOT/Atomic GSOC idea we have @lorbus who is the student doing most of the work. We also have @pbrobinson @jlebon and myself helping to coordinate and mentor.

We met last week to go over the high level design of the project. We scratched notes on an etherpad (I'll persist those notes in a text file in this issue as well).

Some high level things that came out of the discussion:

#2 We need a generic health check framework for doing boot up health checks.

This needs to be generic enough to allow OS admins to bring your own boot health check. It needs to either be a daemon or possibly re-use systemd and systemd units to achieve the same goal. We should also provide some sane default boot health checks. rpm-ostree needs some way to interface and discover failed boot health checks.

#3 Ability for bootloader to choose old deployment on failures

Lots of possibilities discussed here. We can use hardware watchdog features and possibly set "boot once" in GRUB/UEFI features to achieve this goal.

#4 Added logic to update system (ostree/rpm-ostree)

If a system failed to boot let's not blindly re-apply the failed update and try again (infinite looping on failure). Showing boot health checks in status output is a +.

We'll break each of these out into subtickets and continue on with specific discussion in each of those.

dustymabe commented 5 years ago

Added backup etherpad text file:

Metadata Update from @pbrobinson:
- Issue tagged with: GSoC

5 years ago

dustymabe commented 5 years ago

had our second discussion today: here is what came of it: https://pagure.io/fedora-iot/issue/2#comment-511940

dustymabe commented 5 years ago

had our third discussion today: action items currently:

@dustymabe:

figure out what GSOC first evaluations mean/require (june 11-15)
review greenboot, contribute feedback/ideas

@lorbus :

keep working on greenboot
investigate greenboot "ease of use" options
investigate more options related to #3

@jlebon:

review greenboot, contribute feedback/ideas

@pbrobinson:

review options for #3, offer suggestions
review greenboot, contribute feedback/ideas

jlebon commented 5 years ago

OK, played around with greenboot for a bit. (I really like the name BTW!)

I think I understand why you went with a service instead of a target -- though OTOH, I really like the semantics of targets for our use case. systemd has these nice well-known targets which serve as interfaces for other services to hook into. One I recently came across was: the nss-user-lookup.target can be used to separate services that provide NSS lookup services (like SSSD) from those that require it (like systemd-logind). Similarly, a greenboot.target can be the interface other units plug into. (E.g. both for running things before the target, like the various tests, but also after, e.g. telling the mothership we're ready).

Here's a strawman:

greenboot.target with a WantedBy on multi-user.target
test services can put a RequiredBy/WantedBy on greenboot.target
and then a greenboot.service with After=greenboot.target to run whatever we want if we had a successful boot or not? (I.e. use systemd API to get the status of greenboot.target and e.g. update the bootloader config, stop the watchdog, etc...)

lorbus commented 5 years ago

@jlebon: Thanks for the feedback! I'm seeing the usecase for targets now more and more, too and had a similar idea of what to use greenboot.target for (see my comment in issue #3)

I'll incorporate this in the next iteration of greenboot!

Edited 5 years ago by lorbus

dustymabe commented 5 years ago

migrated your comments to https://pagure.io/fedora-iot/issue/2 - let's keep the discussion over there :)

lorbus commented 5 years ago

Should we have another meeting to wrap things up? I think @pbrobinson said he wasn't available this Friday, so maybe next week?

Metadata Update from @pbrobinson:
- Issue close_status updated to: fixed
- Issue status updated to: Closed (was: Open)

4 years ago

Metadata

Assignee

None

Tags

Blocking

None

Depending on

None

Attachments 1

etherpad.txt

Attached 5 years ago View Comment

fedora-iot

Source Code

Documentation

#1 GSOC: design discussion meeting Closed: fixed 4 years ago by pbrobinson. Opened 5 years ago by dustymabe.

Metadata

GSoC

Attachments 1

#1 GSOC: design discussion meeting

Closed: fixed 4 years ago by pbrobinson. Opened 5 years ago by dustymabe.