Document why we don't use maps in the API in api-conventions.md #2004

bgrant0607 · 2014-10-27T17:54:23Z

Forked from #1980.

The API has a number of lists containing names embedded in objects, such as Volumes, Containers, Ports, Env, and VolumeMounts. Both configuration and field references (e.g., in filter expressions, events) are uglier and/or more verbose when using lists rather than maps. This puts us at a disadvantage compared to other systems with more elegant API and/or configuration schemas (e.g., Fig).

Automatically translating maps into lists of named objects appears to be hard. In JSON and YAML, structures and maps cannot be distinguished without a schema. It looks like we'd need either duplicate fields, duplicate schemas, or a custom parser in order to support both formats in the same API version. Duplicate schemas have proven hard to maintain, both in Kubernetes and internally. I don't think we want to maintain parallel API versions forever, either. The go-yaml parser is around 9k lines of code -- a custom parser is not something we want to own, IMO.

The specific proposal here is to:

change these lists to maps
make these name fields optional and auto-populate them from map keys in apiserver, so that names are available in subobjects even without the maps to which they belong

Port name is currently optional. It would effectively be required. A convention of "p" would be straightforward for users/tools, however, such as in the case of ports auto-populated from Docker images (e.g., by podex).

This would be a breaking change. We could do it in v1beta3.

The names of top-level objects, in ObjectMeta in v1beta3, would not be changed. Those can be auto-populated in v1beta3 by clients in a straightforward manner.

/cc @smarterclayton @erictune @proppy @thockin @jbeda

jbeda · 2014-10-27T18:13:06Z

We've gone back an forth on this in the past in various config discussions. Here is the last time I remember on k8s: #853 (comment)

The crux of this problem is that it isn't clear to the user what "left hand side strings" are "magic keywords" in the config system/API vs. which are user data.

Hoisting the example from that other bug -- compare these two:

Example A:

ports:
  - name: www
    hostPort: 80
    containerPort: 80
    protocol: tcp

Example B:

ports:
  www:
    hostPort: 80
    containerPort: 80
    protocol: tcp

While B is obviously shorter than A, I think it is more confusing for the novice user. When copy/pasting examples or reading unfamiliar configs, the novice user won't know what www is. Is this a magic value that they aren't supposed to change (like ports) or is it an input/naming thing that they should change?

If we follow your suggestion and a name into the sub object It'll be even more confusing:

ports:
  www:
    name: www
    hostPort: 80
    containerPort: 80
    protocol: tcp

Questions users'll be asking: why is www there twice? What happens if I change one but not the other?

bgrant0607 · 2014-10-27T18:51:29Z

@jbeda Thanks for pointing me at the previous discussion. I remembered that it had come up before but couldn't remember where.

I see your point regarding distinguishing schema keys and user names. However, we don't have enough user data to be able to really know which they'd prefer.

Also, maps are used in competing solutions, such as Fig:

https://gist.github.com/proppy/8e714bf41b6b0978ab0e#fig2kube

Admittedly, Fig does this only for their top-level objects -- containers in their case, and I do the same in my configuration generators. Although, they do accept both maps and lists for environment variables. I'll look at how they do this (note: Fig is python, not Go).

Another alternative could be to make all subobject names optional. There are no subobject names in Docker's API, nor in Fig. However, one reason we have names for subobjects is that we've added a parent object around containers and volumes.

With respect to the example above: Mainly subobject names would be needed internally rather than in serialized form. We could potentially omit them from serialized form by specifying "-" so that the fields are ignored during marshaling/unmarshaling.

jbeda · 2014-10-27T19:46:47Z

We should work to reduce the amount of boiler plate, for sure, but I'm not sure that the API is the place to do it. If we want to have a more concise config language/schema -- go for it. There is room in the world to allow for different trade offs.

There are other things we want to avoid here -- significantly, each key should have one and only one form for what it accepts -- this let's us have a strongly typed schema instead of forcing us to interpret the yaml parse tree with custom code.

bgrant0607 · 2014-10-27T21:00:03Z

Fig effectively has a custom parser:
https://github.com/docker/fig/blob/master/fig/service.py#L369

bgrant0607 · 2014-10-31T04:01:32Z

Abandoning this idea. Converting it to a doc bug to document the reason for the way the API is.

Documentation improvements. Fixes #2004, #2115, #2171.

bgrant0607 · 2015-02-26T20:11:42Z

@ghodss has pointed out that lists do not allow generic merging for configuration updates.

bgrant0607 · 2015-02-27T07:34:26Z

Reclosing in favor of #4889.

bgrant0607 added area/api Indicates an issue on api area. area/usability labels Oct 27, 2014

bgrant0607 added the area/app-lifecycle label Oct 30, 2014

bgrant0607 changed the title ~~Consider converting all subobject lists to maps in API~~ Document why we don't use maps in the API in api-conventions.md Oct 31, 2014

bgrant0607 added the kind/documentation Categorizes issue or PR as related to documentation. label Oct 31, 2014

bgrant0607 self-assigned this Nov 6, 2014

bgrant0607 closed this as completed in d5700ea Nov 17, 2014

brendandburns added a commit that referenced this issue Nov 17, 2014

Merge pull request #2423 from bgrant0607/docfix

6fa798c

Documentation improvements. Fixes #2004, #2115, #2171.

bgrant0607 mentioned this issue Dec 1, 2014

Add node status to API object. #2315

Merged

bgrant0607 reopened this Feb 26, 2015

bgrant0607 mentioned this issue Feb 27, 2015

api: return endpoints pod identifiers #4482

Merged

ghodss mentioned this issue Feb 27, 2015

API should differentiate between lists to merge and lists to replace in reconciliation #4889

Closed

bgrant0607 closed this as completed Feb 27, 2015

bgrant0607 mentioned this issue Mar 16, 2015

REST api - "env" section seems to be structured differently than other key value pair attributes #5490

Closed

aweiteka mentioned this issue May 4, 2015

Use list of apps instead of magic key naming projectatomic/nulecule#35

Closed

goern mentioned this issue Jun 7, 2016

Removing graph(s) in favour of app-name projectatomic/nulecule#210

Closed

dragoslav mentioned this issue Jun 19, 2016

Lists of named subobjects vs maps magneticio/vamp#684

Open

pmorie mentioned this issue Nov 20, 2016

Refactor status for API resources to conditions instead of states kubernetes-retired/service-catalog#41

Merged

killwing mentioned this issue May 29, 2018

fill MXJob structure qiniu-ava/mxnet-operator#3

Merged

This was referenced Sep 30, 2019

Implement a KafkaConnector CRD and Operator strimzi/strimzi-kafka-operator#2001

Closed

[Enhancement] Change the kafka-versions file into JSON/YAML format strimzi/strimzi-kafka-operator#2040

Closed

rajathagasthya mentioned this issue Feb 26, 2020

✨ add support for pointers as map values kubernetes-sigs/controller-tools#317

Merged

porridge mentioned this issue Mar 9, 2020

KEP-26: Reading parameter values from a file kudobuilder/kudo#1364

Merged

hasheddan mentioned this issue Aug 10, 2020

Package Manager refactor design doc crossplane/crossplane#1616

Merged

2 tasks

ceclinux mentioned this issue Feb 9, 2021

Rule based networkpolicystats antrea-io/antrea#1780

Merged

jerop mentioned this issue Aug 1, 2022

Add types and client for Resolution tektoncd/pipeline#5200

Merged

7 tasks

jerop mentioned this issue Oct 12, 2023

[TEP-0143] Concise Parameters and Results - proposed tektoncd/community#1071

Merged

youngnick mentioned this issue Feb 19, 2024

Defaults & Overrides (RFC 0009) Kuadrant/architecture#58

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Document why we don't use maps in the API in api-conventions.md #2004

Document why we don't use maps in the API in api-conventions.md #2004

bgrant0607 commented Oct 27, 2014

jbeda commented Oct 27, 2014

bgrant0607 commented Oct 27, 2014

jbeda commented Oct 27, 2014

bgrant0607 commented Oct 27, 2014

bgrant0607 commented Oct 31, 2014

bgrant0607 commented Feb 26, 2015

bgrant0607 commented Feb 27, 2015

Document why we don't use maps in the API in api-conventions.md #2004

Document why we don't use maps in the API in api-conventions.md #2004

Comments

bgrant0607 commented Oct 27, 2014

jbeda commented Oct 27, 2014

bgrant0607 commented Oct 27, 2014

jbeda commented Oct 27, 2014

bgrant0607 commented Oct 27, 2014

bgrant0607 commented Oct 31, 2014

bgrant0607 commented Feb 26, 2015

bgrant0607 commented Feb 27, 2015