Network Working Group P. Faltstrom Internet-Draft R. Austein Expires: April 11, 2005 IAB October 11, 2004 Design Choices When Expanding DNS draft-iab-dns-choices-00.txt Status of this Memo By submitting this Internet-Draft, I certify that any applicable patent or other IPR claims of which I am aware have been disclosed, and any of which I become aware will be disclosed, in accordance with RFC 3668. Internet-Drafts are working documents of the Internet Engineering Task Force (IETF), its areas, and its working groups. Note that other groups may also distribute working documents as Internet-Drafts. Internet-Drafts are draft documents valid for a maximum of six months and may be updated, replaced, or obsoleted by other documents at any time. It is inappropriate to use Internet-Drafts as reference material or to cite them other than as "work in progress." The list of current Internet-Drafts can be accessed at http://www.ietf.org/ietf/1id-abstracts.txt. The list of Internet-Draft Shadow Directories can be accessed at http://www.ietf.org/shadow.html. This Internet-Draft will expire on April 11, 2005. Copyright Notice Copyright (C) The Internet Society (2004). All Rights Reserved. Abstract This note discusses how to extend the DNS with new data for a new application. DNS extension discussion too often circulate around reuse of the TXT record. This document lists different mechanisms to accomplish the extension to DNS, and comes to the conclusion use of a new RR Type is the best solution. Faltstrom & Austein Expires April 11, 2005 [Page 1] Internet-Draft Design Choices When Expanding DNS October 2004 Table of Contents 1. Introduction . . . . . . . . . . . . . . . . . . . . . . . . . 3 2. Background . . . . . . . . . . . . . . . . . . . . . . . . . . 3 3. Extension mechanisms . . . . . . . . . . . . . . . . . . . . . 4 3.1 Place selectors inside the RDATA . . . . . . . . . . . . . 4 3.2 Add a prefix to the owner name . . . . . . . . . . . . . . 4 3.3 Add a suffix to the owner name . . . . . . . . . . . . . . 5 3.4 Add a new Class . . . . . . . . . . . . . . . . . . . . . 5 3.5 Add a new Resource Record Type . . . . . . . . . . . . . . 6 4. Conclusion (why adding a new RR type is the answer) . . . . . 7 5. Conclusion and Recommendation . . . . . . . . . . . . . . . . 9 6. IANA Considerations . . . . . . . . . . . . . . . . . . . . . 10 7. Security Considerations . . . . . . . . . . . . . . . . . . . 10 8. References . . . . . . . . . . . . . . . . . . . . . . . . . . 10 Authors' Addresses . . . . . . . . . . . . . . . . . . . . . . 11 Intellectual Property and Copyright Statements . . . . . . . . 12 Faltstrom & Austein Expires April 11, 2005 [Page 2] Internet-Draft Design Choices When Expanding DNS October 2004 1. Introduction The DNS stores multiple categories of data. The two most commonly used categories are infrastructure data for the DNS system itself (NS and SOA records) and data which have to do with mappings between domain names and IP addresses (A, AAAA and PTR). There are other categories as well, some of which are tied to specific applications like email (MX), while others are generic record types used to convey information for multiple protocols (SRV, NAPTR). When storing data in the DNS for a new application, the data are usually tied to a "normal" domain name, so the application can query for the data it wants, while minimizing the impact on existing applications. Historically, extension of DNS to store data for applications tied to a domain name has been done in different ways at different times. MX records were created as a new resource record type specifically designed to support electronic mail. SRV records are a generic type which use a prefixing scheme in combination with a base domain name. Records associated with ENUM use a suffixing scheme. NAPTR records add selection data inside the RDATA. It is clear the way of adding new data to the DNS has been inconsistent, and the purpose of this document is to attempt to clarify the implications of each of these methods, both for the applications that use them and for the rest of the DNS system. 2. Background See RFC 2929 [RFC2929] for a brief summary of DNS query structure. Readers interested in the full story should start with the base DNS specification in RFC 1035 [RFC1035], and continue with the various documents which update, clarify, and extend the base specification. When composing a query into the DNS system, the parameters actually used by the protocol are a triple: a DNS name, a DNS class, and a DNS record type. Every resource record (RR) matching a particular name, type and class is said to belong to the same resource record set (RRset), and the whole RRset is always returned to the client which queries for it. Splitting an RRset is a protocol violation, because it results in coherency problems with the DNS caching mechanism. Note however that this requirement has nothing to do with the MTU size and choice of UDP or TCP as the transport protocol. The whole RRset always has to be returned, and if it doesn't fit in the MTU in UDP, TCP has to be used to not break this RRset atomic requirement. Faltstrom & Austein Expires April 11, 2005 [Page 3] Internet-Draft Design Choices When Expanding DNS October 2004 3. Extension mechanisms The DNS protocol is intended to be extensible to support new kinds of data. This section examines the various ways in which this sort of extension can be accomplished. 3.1 Place selectors inside the RDATA For a given query name, one might choose to have a single RRset (all sharing the same name, type, and class) shared by multiple applications, and have the different applications use selectors within the RR data (RDATA) to determine which records are intended for which applications. This sort of selector mechanism is usually referred to "subtyping", because it is in effect creating an additional type subsystem within a single DNS RR type. Examples of subtyping include NAPTR RRs (see RFC 2916 [RFC2916]) and the original DNSSEC KEY RR type (RFC 2535 [RFC2535]) (before it was updated by RFC 3445 [RFC3445]). All DNS subtyping schemes share a common weakness: With subtyping schemes it is impossible for a client to query for just the data it wants. Instead, the client must fetch the entire RRset, then select the RRs in which it is interested. Furthermore, since DNSSEC signatures operate on complete RRsets, the entire RRset must be re-signed if any RR in it changes. As a result, each application that uses a subtyped RR incurs higher overhead than any of the applications would have incurred had they not been using a subtyping scheme. The fact the RRset is always passed around as an indivisible unit increases the risk the RRset will not fit in a UDP packet, which in turn increases the risk that the client will have to retry the query with TCP, which substantially increases the load on the name server. More precisely: Having one query fail over to TCP is not a big deal, but since the typical ratio of clients to servers in the DNS system is very high, having a substantial number of DNS messages fail over to TCP it will cause the relevant name servers to be overloaded by TCP overhead. The final result of using a subtyping scheme might be that the zone administrator has to choose which of the services tied to one domain name can actually be used, because not all of them can be announced at the same time. 3.2 Add a prefix to the owner name By adding an application-specific prefix to a domain name, we will get different name/class/type triples, and therefore different RRsets. The problem with adding prefixes has to do with wildcards, Faltstrom & Austein Expires April 11, 2005 [Page 4] Internet-Draft Design Choices When Expanding DNS October 2004 especially if one has records like *.example.com. IN MX 1 mail.example.com. and one wants records tied to those names. Suppose one creates the prefix "_mail". One would then have to say something like _mail.*.example.com. IN X-FOO A B C D but DNS wildcards only work with the "*" as the leftmost token in the domain name. Even when a specific prefix is chosen, the data will still have to be stored in some RR type. This RR type can either be a "kitchen-sink record" or a new RR type. This implies that some other mechanism has to be applied as well, with implications detailed in other parts of this note (see also draft-ietf-dnsext-wcard-clarify [wcardclarify] regarding use of wildcards and DNS). 3.3 Add a suffix to the owner name Adding a suffix to a domain name changes the name/class/type triple, and therefore the RRset. The query name can be set to exactly the data one wants, and the size of the RRset is minimized. The problem with adding a suffix is that it creates a parallel tree within the IN class. There will be no technical mechanism to ensure that the delegation for "example.com" and "example.com._bar" are made to the same organization. Furthermore, data associated with a single entity will now be stored in two different zones, such as "example.com" and "example.com._bar", which, depending on who controls "_bar", can create new synchronization and update authorization issues. Even when using a different name, the data will still have to be stored in some RR type. This RR type can either be a "kitchen-sink record" or a new RR type. This implies that some other mechanism has to be applied as well, with implications detailed in other parts of this note. In RFC 2163 [RFC2163] an infix token is inserted directly below the TLD, but the result is the same as adding a suffix to the owner name (and because of that creation of a new TLD). 3.4 Add a new Class DNS zones are class-specific, in the sense that all the records in that zone share the same class as the zone's SOA record, and the existence of a zone in one class does not guarantee the existence of the zone in any other class. In practice, only the IN class has ever Faltstrom & Austein Expires April 11, 2005 [Page 5] Internet-Draft Design Choices When Expanding DNS October 2004 seen widespread deployment, and the administrative overhead of deploying an additional class would almost certainly be prohibitive. Nevertheless, one could in theory use the DNS class mechanism to distinguish between different kinds of data. However, since the DNS delegation tree (represented by NS RRs) is itself tied to a specific class, attempting to resolve a query by crossing a class boundary may produce unexpected results, because there is no guarantee that the name servers for the zone in the new class will be the same as the name servers in the IN class. The MIT Hesiod system used a scheme like this for storing data in the HS class, but only on a very small scale (within a single institution), and with an administrative fiat requiring that the delegation trees for the IN and HS trees be identical. Even when using a different class, the data will still have to be stored in some RR type or another. This RR type can either be a "kitchen-sink record" or a new RR type. This implies that some other mechanism has to be applied as well, with implications detailed in other parts of this note. 3.5 Add a new Resource Record Type When adding a new Resource Record type to the system, entities in four different roles have to be able to handle the new type: 1. There must be a way to insert the new resource records into the zone of the Primary Master name server. For some server implementations, the user interface only accepts record types which it understands (perhaps so that the implementation can attempt to validate the data). Other implementations allow the zone administrator to enter an integer for the resource record type code and the RDATA in Base64 or hexadecimal encoding (or even as raw data). RFC 3597 [RFC3597] specifies a standard generic encoding for this purpose. 2. A slave authoritative name server must be able to do a zone transfer, receive the data from some other authoritative name server, and serve data from the zone even though the zone includes records of unknown types. Historically, some implementations have had problems parsing stored copies of the zone file after restarting, but those problems have not been seen for a few years. 3. A full service resolver will cache the records which are responses to queries. As mentioned in RFC 3597 [RFC3597],there are various pitfalls where a recursive name server might end up having problems. 4. The application must be able to get the record with a new resource record type. The application itself may understand the Faltstrom & Austein Expires April 11, 2005 [Page 6] Internet-Draft Design Choices When Expanding DNS October 2004 RDATA, but the resolver library might not. Support for a generic interface for retrieving arbitrary DNS RR types has been a requirement since 1989 (see RFC 1123 [RFC1123] Section 6.1.4.2). Some stub resolver library implementations neglect to provide this functionality and cannot handle unknown RR types, but implementation of a new stub resolver library is not particularly difficult, and open source libraries that already provide this functionality are available. 4. Conclusion (why adding a new RR type is the answer) By now, the astute reader will be wondering about how to use the issues presented so far. We will now attempt to clear up the reader's confusion by following the thought processes of a typical application designer who wishes to store stuff in the DNS, showing how such a designer almost inevitably hits upon the idea of just using TXT RR, and why this is a bad thing, and instead why a new RR type is to be allocated. A typical application designer is not interested in the DNS for its own sake, but rather as a distributed database in which application data can be stored. As a result, the designer of a new application is usually looking for the easiest way to add whatever new data the application needs to the DNS in a way that naturally associates the data with a DNS name. As explained in Section 3.4, using the DNS class system as an extension mechanism is not really an option, and in fact most users of the system don't even realize that the mechanism exists. As a practical matter, therefore any extension is likely to be within the IN class. Adding a new RR type is the technically correct answer from the DNS protocol standpoint (more on this below), doing so requires some DNS expertise, due to the issues listed in Section 3.5. As a result, this option is usually considered too hard. The application designer is thus left with the prospect of reusing some existing DNS type within the IN class, but when the designer looks at the existing types, almost all of them have well-defined semantics, none of which quite match the needs of the new application. This has not completely prevented proposals to reuse existing RR types in ways incompatible with their defined semantics, but it does tend to steer application designers away from this approach. RR Type 40 was registered for the SINK record type. This RR Type was discussed in the DNSIND working group of the IETF, and it was decided Faltstrom & Austein Expires April 11, 2005 [Page 7] Internet-Draft Design Choices When Expanding DNS October 2004 at the 46th IETF to not move the I-D forward to become an RFC. Reason was the risk for large RR sets and the ability for application creators to use the SINK RR Type instead of registering a new RR Type. Eliminating all of the above leaves the TXT RR type in the IN class. The TXT RDATA format is free form text, and there are no existing semantics to get in the way. Furthermore, the TXT RR can obviously just be used as a bucket in which to carry around data to be used by some higher level parser, perhaps in some human readable programming or markup language. Thus, for many applications, TXT RRs are the "obvious" choice. Unfortunately, this conclusion, while understandable, is also wrong, for several reasons. The first reason why TXT RRs are not well suited to such use is precisely the lack of defined semantics that make them so attractive. Arguably, the TXT RR is misnamed, and should have been called the Local Container record, because the lack of defined semantics means that a TXT RR means precisely what the data producer says it means. This is fine, so long as TXT RRs are being used by human beings or by private agreement between data producer and data consumer. However, it becomes a problem once one starts using them for standardized protocols in which there is no prior relationship between data producer and data consumer. Reason for this is that there is nothing to prevent collisions with some other incompatible use of TXT RRs. This is even worse than the general subtyping problem described in Section 3.1, because TXT RRs don't even have a standardized selector field in which to store the subtype. RFC1464 [RFC1464] tried, but it was not a success. At best a definition of a subtype is reduced to hoping that whatever scheme one has come up with will not accidently conflict with somebody else's subtyping scheme, and that it will not be possible to mis-parse one application's use of TXT RRs as data intended for a different application. Any attempt to come up with a standardized format within the TXT RR format would be at least fifteen years too late even if it were put into effect immediately. Using one of the naming modifications discussed in Section 3.2 and Section 3.3 would address the subtyping problem, but each of these approaches brings in new problems of its own. The prefix approach (such as SRV RRs use) does not work well with wildcards, which is a particular problem for mail-related applications, since MX RRs are probably the most common use of DNS wildcards. The suffix approach doesn't have wildcard issues, but, as noted previously, it does have synchronization and update authorization issues, since it works by creating a second subtree in a different part of the global DNS name space. The next reason why TXT RRs are not well suited to protocol use has Faltstrom & Austein Expires April 11, 2005 [Page 8] Internet-Draft Design Choices When Expanding DNS October 2004 to do with the limited data space available in a DNS message. As alluded to briefly in Section 3.1, typical DNS query traffic patterns involve a very large number of DNS clients sending queries to a relatively small number of DNS servers. Normal path MTU discovery schemes do little good here, because, from the server's perspective, there isn't enough repeat traffic from any one client for it to be worth retaining state. UDP-based DNS is an idempotent query, whereas TCP-based DNS requires the server to keep state (in the form of TCP connection state, usually in the server's kernel) and roughly triples the traffic load. Thus, there's a strong incentive to keep DNS messages short enough to fit in a UDP datagram, preferably a UDP datagram short enough not to require IP fragmentation. Subtyping schemes are therefore again problematic, because they produce larger RRsets than necessary, but verbose text encodings of data are also wasteful, since the data they hold can usually be represented more compactly in a resource record designed specifically to support the application's particular data needs. If the data that need to be carried are so large that there is no way to make them fit comfortably into the DNS regardless of encoding, it is probably better to move the data somewhere else, and just use the DNS as a pointer to the data, as with NAPTR. 5. Conclusion and Recommendation Given the problems detailed in Section 4, it is worth reexamining the oft-jumped-to conclusion that specifying a new RR type is hard. Historically, this was indeed the case, but recent surveys suggest that support for unknown RR types [RFC3597] is now widespread, and that lack of support for unknown types is mostly an issue for relatively old software that would probably need to be upgraded in any case as part of supporting a new application. One should also remember that deployed DNS software today should support DNSSEC, and software recent enough to do so will have higher chance of being able to also support RFC3597. Of all the issues detailed in Section 3.5, provisioning the data is in some respects the most difficult. The problem here is less the authoritative name servers themselves than the front-end systems used to enter (and perhaps validate) the data. Hand editing does not work well for maintenance of large zones, so some sort of tool is necessary, and the tool may not be tightly coupled to the name server implementation itself. Note, however, that this provisioning problem exists to some degree with any new form of data to be stored in the DNS, regardless of data format, RR type, or naming scheme. Adapting front-end systems to support a new RR type may be a bit more difficult than reusing an existing type, but this appears to be a minor difference in degree rather than a difference in kind. Faltstrom & Austein Expires April 11, 2005 [Page 9] Internet-Draft Design Choices When Expanding DNS October 2004 Given the various issues described in this note, we believe that: o there is no magic solution which allows a completely painless addition of new data to the DNS, but o on the whole, the best solution is still to use the DNS type mechanism designed for precisely this purpose, and o of all the alternate solutions, the "obvious" approach of using TXT RRs is almost certainly the worst. 6. IANA Considerations This document does not require any IANA actions. 7. Security Considerations DNS RRsets can be signed using DNSSEC. DNSSEC is almost certainly necessary for any application mechanism that stores authorization data in the DNS itself. DNSSEC signatures significantly increase the size of the messages transported, and because of this, the DNS message size issues discussed in Section 3.1 and Section 4 are more serious than they might at first appear. Adding new RR types (as discussed in Section 3.5) might conceivably trigger bugs and other bad behavior in software which is not compliant with RFC 3597 [RFC3597], but most such software is old enough and insecure enough that it should be updated for other reasons in any case. Basic API support for retrieving arbitrary RR types has been a requirement since 1989 (see RFC 1123 [RFC1123]). Any new protocol that proposes to use the DNS to store data used to make authorization decisions would be well advised not only to use DNSSEC but also to encourage upgrades to DNS server software recent enough not to be riddled with well-known exploitable bugs. Because of this, support for new RR Types will not be as hard as people might think at first. 8 References [RFC1035] Mockapetris, P., "Domain names - implementation and specification", STD 13, RFC 1035, November 1987. [RFC1123] Braden, R., "Requirements for Internet Hosts - Application and Support", STD 3, RFC 1123, October 1989. [RFC1464] Rosenbaum, R., "Using the Domain Name System To Store Arbitrary String Attributes", RFC 1464, May 1993. [RFC2163] Allocchio, C., "Using the Internet DNS to Distribute MIXER Conformant Global Address Mapping (MCGAM)", RFC 2163, Faltstrom & Austein Expires April 11, 2005 [Page 10] Internet-Draft Design Choices When Expanding DNS October 2004 January 1998. [RFC2535] Eastlake, D., "Domain Name System Security Extensions", RFC 2535, March 1999. [RFC2606] Eastlake, D. and A. Panitz, "Reserved Top Level DNS Names", BCP 32, RFC 2606, June 1999. [RFC2671] Vixie, P., "Extension Mechanisms for DNS (EDNS0)", RFC 2671, August 1999. [RFC2916] Faltstrom, P., "E.164 number and DNS", RFC 2916, September 2000. [RFC2929] Eastlake, D., Brunner-Williams, E. and B. Manning, "Domain Name System (DNS) IANA Considerations", BCP 42, RFC 2929, September 2000. [RFC3445] Massey, D. and S. Rose, "Limiting the Scope of the KEY Resource Record (RR)", RFC 3445, December 2002. [RFC3597] Gustafsson, A., "Handling of Unknown DNS Resource Record (RR) Types", RFC 3597, September 2003. [wcardclarify] Halley, B. and E. Lewis, "draft-ietf-dnsext-wcard-clarify-02.txt, Clarifying the Role of Wild Card Domains in the Domain Name System, work in progress", September 2003. Authors' Addresses Patrik Faltstrom IAB EMail: paf@cisco.com Rob Austein IAB EMail: sra@isc.org Faltstrom & Austein Expires April 11, 2005 [Page 11] Internet-Draft Design Choices When Expanding DNS October 2004 Intellectual Property Statement The IETF takes no position regarding the validity or scope of any Intellectual Property Rights or other rights that might be claimed to pertain to the implementation or use of the technology described in this document or the extent to which any license under such rights might or might not be available; nor does it represent that it has made any independent effort to identify any such rights. Information on the procedures with respect to rights in RFC documents can be found in BCP 78 and BCP 79. Copies of IPR disclosures made to the IETF Secretariat and any assurances of licenses to be made available, or the result of an attempt made to obtain a general license or permission for the use of such proprietary rights by implementers or users of this specification can be obtained from the IETF on-line IPR repository at http://www.ietf.org/ipr. The IETF invites any interested party to bring to its attention any copyrights, patents or patent applications, or other proprietary rights that may cover technology that may be required to implement this standard. Please address the information to the IETF at ietf-ipr@ietf.org. Disclaimer of Validity This document and the information contained herein are provided on an "AS IS" basis and THE CONTRIBUTOR, THE ORGANIZATION HE/SHE REPRESENTS OR IS SPONSORED BY (IF ANY), THE INTERNET SOCIETY AND THE INTERNET ENGINEERING TASK FORCE DISCLAIM ALL WARRANTIES, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO ANY WARRANTY THAT THE USE OF THE INFORMATION HEREIN WILL NOT INFRINGE ANY RIGHTS OR ANY IMPLIED WARRANTIES OF MERCHANTABILITY OR FITNESS FOR A PARTICULAR PURPOSE. Copyright Statement Copyright (C) The Internet Society (2004). This document is subject to the rights, licenses and restrictions contained in BCP 78, and except as set forth therein, the authors retain all their rights. Acknowledgment Funding for the RFC Editor function is currently provided by the Internet Society. Faltstrom & Austein Expires April 11, 2005 [Page 12]