Add a position paper about the goals of Luau (#55)

Added a position paper on Luau for submission to HATRA 2021 Co-authored-by: Lily Brown <lbrown@roblox.com> Co-authored-by: Andy Friesen <afriesen@roblox.com>
2021-07-20 13:41:04 -05:00 · 2021-07-20 13:41:04 -05:00 · 7c7eb800f5
parent 01d9f8daf9
commit 7c7eb800f5
7 changed files with 507 additions and 0 deletions
--- a/papers/.gitignore
+++ b/papers/.gitignore
@ -0,0 +1,9 @@
+*.aux
+*.bbl
+*.blg
+*.dvi
+*.fdb_latexmk
+*.fls
+*.log
+*.out
+*.xcp
--- a/papers/hatra21/README.md
+++ b/papers/hatra21/README.md
@ -0,0 +1,43 @@
+# HATRA 21 position paper
+
+A position paper on Luau for [Human Aspects of Types and Reasoning Assistants](https://2021.splashcon.org/home/hatra-2021) (HATRA) 2021.
+
+## Installing latexmk
+
+First install  basictex
+```
+sudo brew install basictex
+```
+
+Then install the dependencies for the paper (sigh, by hand):
+
+```
+sudo tlmgr update --all
+sudo tlmgr install acmart
+sudo tlmgr install iftex
+sudo tlmgr install xstring
+sudo tlmgr install environ
+sudo tlmgr install totpages
+sudo tlmgr install trimspaces
+sudo tlmgr install manyfoot
+sudo tlmgr install ncctools
+sudo tlmgr install comment
+sudo tlmgr install balance
+sudo tlmgr install preprint
+sudo tlmgr install libertine
+sudo tlmgr install inconsolata
+sudo tlmgr install newtx
+sudo tlmgr install latexmk
+```
+
+## Building the paper
+
+To build the paper:
+```
+latexmk --pdf hatra21
+```
+
+To run latexmk in watching mode (where it rebuilds the PDF on each change):
+```
+latexmk --pdf --pvc hatra21
+```
--- a/papers/hatra21/bibliography.bib
+++ b/papers/hatra21/bibliography.bib
@ -0,0 +1,115 @@
+@Misc{Roblox,
+  author =    {Roblox},
+  title =     {What is {Roblox}},
+  year = 2021,
+  url = {https://corp.roblox.com},
+}
+
+@Misc{Luau,
+  author =    {Roblox},
+  title =        {The {Luau} Programming Language},
+  year = 2021,
+  url = {https://luau-lang.org},
+}
+
+@Misc{Lua,
+  author =    {Lua.org and {PUC}-Rio},
+  title =        {The {Lua} Programming Language},
+  year = 2021,
+  url = {https://lua.org},
+}
+
+@Misc{AllEducators,
+  author =       {Roblox},
+  title =        {Roblox Education: All Educators},
+  year =         {2021},
+  url =          {https://education.roblox.com/en-us/educators},
+}
+
+@Misc{RobloxDevelopers,
+  author =       {Roblox},
+  title =        {Roblox Developers Expected to Earn Over \$250 Million in 2020; Platform Now Has Over 150 Million Monthly Active Users
+},
+  year =         {2020},
+  url =          {https://corp.roblox.com/2020/07/roblox-developers-expected-earn-250-million-2020-platform-now-150-million-monthly-active-users/},
+}
+
+@Book{TAPL,
+  author =       {Benjamin C. Pierce},
+  title =        {Types and Programming Languages},
+  publisher =    {{MIT} Press},
+  year =         {2002},
+  isbn =         {0-262-16209-1},
+}
+
+@Book{TDDIdris,
+  author =       {Edwin Brady},
+  title =        {Type-Driven Development with {Idris}},
+  publisher =    {Manning},
+  year =         {2017},
+  isbn =         {9781617293023},
+}
+
+@PhdThesis{TopQuality,
+  author =       {Bastiaan J. Heeren},
+  title =        {Top Quality Type Error Messages},
+  school =       {U. Utrecht},
+  year =         {2005},
+}
+
+@PhdThesis{RepairingTypeErrors,
+  author =       {Bruce J. McAdam},
+  title =        {Repairing Type Errors in Functional Programs},
+  school =       {U. Edinburgh},
+  year =         {2002},
+}
+
+@InProceedings{GradualTyping,
+  author =       {Jeremy G. Siek and Walid Taha},
+  title =        {Gradual Typing for Functional Languages},
+  booktitle =    {Proc. Scheme and Functional Programming Workshop},
+  year =         {2006},
+  pages =        {81-92},
+}
+
+@InProceedings{WellTyped,
+  author =       {Philip Wadler and Robert B. Findler},
+  title =        {Well-typed Programs Can’t be Blamed},
+  booktitle =    {Proc. European Symp. Programming},
+  year =         {2009},
+  pages =        {1-16},
+}
+
+@InProceedings{Contracts,
+  author =       {Robert B. Findler and Matthias Felleisen},
+  title =        {Contracts for Higher-order Functions},
+  booktitle =    {Proc. Int. Conf. Functional Programming},
+  year =         {2002},
+  pages =        {48-59},
+}
+
+@inproceedings{SuccessTyping,
+  author = {Lindahl, Tobias and Sagonas, Konstantinos},
+  title = {Practical Type Inference Based on Success Typings},
+  year = {2006},
+  booktitle = {Proc. Int. Conf. Principles and Practice of Declarative Programming},
+  pages = {167–178},
+}
+
+@InProceedings{IncorrectnessLogic,
+  author = {O'Hearn, Peter W.},
+  title = {Incorrectness Logic},
+  year = {2020},
+  booktitle = {Proc. Symp. Principles of Programming Languages},
+  articleno = {10},
+  pages = {1-32},
+}
+
+@Misc{HowToDrawAnOwl,
+  author =       {Know Your Meme},
+  title =        {How To Draw An Owl},
+  year =         {2010},
+  url =    {https://knowyourmeme.com/memes/how-to-draw-an-owl},
+}
+
+
--- a/papers/hatra21/hatra21.pdf
+++ b/papers/hatra21/hatra21.pdf
--- a/papers/hatra21/hatra21.tex
+++ b/papers/hatra21/hatra21.tex
@ -0,0 +1,340 @@
+\documentclass[acmsmall]{acmart}
+
+\setcopyright{rightsretained}
+\copyrightyear{2021}
+\acmYear{2021}
+\acmConference[HATRA '21]{Human Aspects of Types and Reasoning Assistants}{October 2021}{Chicago, IL}
+\acmBooktitle{HATRA '21: Human Aspects of Types and Reasoning Assistants}
+
+\newcommand{\squnder}[1]{\color{red}\underline{{\color{black}#1}}\color{black}}
+\newcommand{\infer}[2]{\frac{\textstyle#1}{\textstyle#2}}
+\newcommand{\erase}{\mathrm{erase}}
+\newcommand{\evCtx}{\mathcal{E}}
+\newcommand{\NIL}{\mathsf{nil}}
+\newcommand{\TRUE}{\mathsf{true}}
+\newcommand{\FALSE}{\mathsf{false}}
+\newcommand{\NUMBER}{\mathsf{number}}
+\newcommand{\ERROR}{\mathsf{error}}
+\newcommand{\IF}{\mathsf{if}\,}
+\newcommand{\THEN}{\,\mathsf{then}\,}
+\newcommand{\ELSE}{\,\mathsf{else}\,}
+\newcommand{\END}{\,\mathsf{end}}
+\newcommand{\FUNCTION}{\mathsf{function}\,}
+\newcommand{\RETURN}{\mathsf{return}\,}
+
+\begin{document}
+
+\title{Position Paper: Some Goals of the Luau Type System}
+
+\author{Lily Brown}
+\author{Andy Friesen}
+\author{Alan Jeffrey}
+\affiliation{
+  \institution{Roblox}
+  \city{San Mateo}
+  \state{CA}
+  \country{USA}
+}
+
+\begin{abstract}
+  Luau is the scripting language used in creating Roblox experiences.
+  It is a statically-typed language based on the dynamically-typed Lua language,
+  with type inference. These types are used in the
+  IDE, for example when providing autocomplete suggestions. In this
+  paper, we describe some of the goals of the Luau type system,
+  focusing on where the goals are different from those of other type systems.
+\end{abstract}
+
+\maketitle
+
+\section{Introduction}
+
+The Roblox~\cite{Roblox} platform allows anyone to create shared,
+immersive, 3D experiences.  At the time of writing, there are
+approximately 20~million experiences available on Roblox, created
+by 8~million developers.  Roblox creators are often young, for
+example there are over 200~Roblox kids' coding camps in 65~countries
+listed at~\cite{AllEducators}.
+The Luau programming language~\cite{Luau} is the scripting language
+used by developers of Roblox experiences. Luau is derived from the Lua
+programming language~\cite{Lua}, with additional capabilities,
+including a type inference engine.
+
+This paper will discuss some of the goals of the Luau type system,
+focusing on where the goals are different from those of other type systems.
+
+\section{Human Aspects}
+\subsection{Heterogeneous developer community}
+
+Quoting a Roblox 2020 report \cite{RobloxDevelopers}:
+\begin{itemize}
+\item Adopt Me! now has over 10 billion plays and surpassed 1.6 million concurrent users in game earlier this year.
+\item Piggy, launched in January 2020, has close to 5 billion visits in just over six months.
+\item There are now 345,000 developers on the platform who are monetizing their games.
+\end{itemize}
+This demonstrates how heterogeneous the Roblox developer community is:
+developers of experiences with billions of plays are on the same
+platform as children first learning to code. Moreover, \emph{both of
+these groups are important}, as the professional development studios
+bring high-quality experiences to the platform, and the beginning creators
+contribute to the energetic creative community, and will form the next generation of developers.
+
+\subsection{Goal-driven learning}
+
+All developers are goal-driven, but this is especially true for
+learners. A learner will download Roblox Studio (the IDE) with an
+experience in mind, often designing an obstacle course (an ``obby'')
+to play in with their friends.
+
+The user experience of developing a Roblox experience is primarily a
+3D interactive one, seen in Fig.~\ref{fig:studio}(a). The user designs
+and deploys 3D assets such as terrain, parts and joints, and provides
+them with physics attributes such as mass and orientation. The user
+can interact with the experience in Studio, and deploy it to a Roblox
+server so anyone with the Roblox app can play it. Physics, rendering
+and multiplayer are all immediately accessible to all creators.
+
+\begin{figure}
+\includegraphics[width=0.48\textwidth]{studio-mow.png}
+\includegraphics[width=0.48\textwidth]{studio-script-editor.png}
+\caption{Roblox Studio's 3D environment editor (a), and script editor (b)}
+\label{fig:studio}
+\end{figure}
+
+At some point during experience design, the user of Studio has a need
+which can't be met by the physics engine alone. ``The stairs should
+light up when a player walks on them'' or ``a firework is set off
+every few seconds.'' At this point they will discover the script
+editor, seen in Fig.~\ref{fig:studio}(b).
+
+This onboarding experience is different from many initial exposures to
+programming, in that by the time the user first opens the script
+editor, they have already built much of their creation, and have a
+very specific concrete aim.  It suggests a Luau goal for helping the
+majority of creators: \emph{support learning how to perform specific
+tasks} (for example through autocomplete suggestions).
+
+\subsection{Type-driven development}
+
+Professional development studios are also goal-directed (though the
+goals may be more abstract, such as ``decrease user churn'' or
+``improve frame rate'') but have additional needs:
+\begin{itemize}
+
+\item \emph{Code planning}:
+  code spends much of its development time in an incomplete state,
+  with holes that will be filled in later.
+
+\item \emph{Code refactoring}:
+  experiences evolve over time, and it easy for changes to
+  break previously-held invariants.
+
+\item \emph{Defect detection}:
+  code has errors, and detecting these at runtime (for example by crash telemetry)
+  can be expensive and recovery can be time-consuming.
+  
+\end{itemize}
+Detecting defects ahead-of-time is a traditional goal of type systems,
+resulting in an array of techniques for establishing safety results,
+surveyed for example in~\cite{TAPL}. Supporting code planning and
+refactoring are some of the goals of \emph{type-driven
+development}~\cite{TDDIdris} under the slogan ``type, define,
+refine''.  For example. a common use of type-driven development is to
+rename a property, which is achieved by changing the name in one place,
+and then fixing the resulting type errors---once the type system stops
+reporting errors, the refactoring is complete.
+
+To help support the transition from novice to experienced developer,
+types are introduced gradually, through API documentation and type discovery.
+Type inference provides many of the benefits of type-driven development
+even to creators who are not explicitly providing types.
+
+\section{Types}
+\subsection{Infallible types}
+
+Goal: \emph{support type-driven tools for all programs}.
+
+Programs spend much of their time under development in an incomplete state, even if the final artifact
+is well-typed. Tools should support this by providing type information even for ill-typed programs.
+An analogy is infallible parsers, which perform error recovery and provide an AST for all input texts.
+
+Program analysis can still flag type errors, for example with red
+squiggly underlining. Formalizing this, rather than a judgment
+$\Gamma\vdash M:T$, for an input term $M$, there is a judgment
+$\Gamma \vdash M \Rightarrow N : T$ where $N$ is an output term
+where some subterms are \emph{flagged} as having type errors, written $\squnder{N}$. Write $\erase(N)$
+for the result of erasing flaggings: $\erase(\squnder{N}) = \erase(N)$.
+
+%% For example the usual
+%% type rules for field access becomes:
+%% \[
+%%   \infer{
+%%     \Gamma \vdash M \Rightarrow M' : T
+%%   }{
+%%     \Gamma \vdash M.\ell \Rightarrow M'.\ell : U
+%%   }
+%%   [
+%%     T = \{ \overline{\ell:U} \} \mbox{ and } (\ell:U) \in (\overline{\ell:U})
+%%   ]
+%% \]
+%% but there is also a rule for unsuccessful field access:  
+%% \[
+%%   \infer{
+%%     \Gamma \vdash M \Rightarrow M' : T
+%%   }{
+%%     \Gamma \vdash M.\ell \Rightarrow \squnder{M'.\ell} : U
+%%   }
+%%   [
+%%     T = \{ \overline{\ell:U} \} \mbox{ implies } \ell \not\in \overline{\ell}
+%%   ]
+%% \]
+%% In this type rule, $U$ is unconstrained.
+
+The goal of infallible types is that every term can be typed:
+\begin{itemize}
+\item \emph{Typability}: for every $M$ and $\Gamma$,
+  there are $N$ and $T$ such that $\Gamma \vdash M \Rightarrow N : T$.
+\item \emph{Erasure}: if $\Gamma \vdash M \Rightarrow N : T$
+  then $\erase(M) = \erase(N)$ 
+\end{itemize}
+Some issues raised by infallible types:
+\begin{itemize}
+\item Which heuristics should be used to provide types for flagged programs? For example, could one
+  use minimal edit distance to correct for spelling mistakes in field names?
+\item How can we avoid cascading type errors, where a developer is
+  faced with type errors that are artifacts of the heuristics rather
+  than genuine errors?
+\item How can the goals of an infallible type system be formalized?
+\end{itemize}
+\emph{Related work}:
+there is a large body of work on type error reporting
+(see, for example, the survey in~\cite[Ch.~3]{TopQuality})
+and on type-directed program repair
+(see, for example, the survey in~\cite[Ch.~3]{RepairingTypeErrors}),
+but not type repair, or on
+the semantics of programs with type errors. Many compilers perform
+error recovery during typechecking, but do not provide a semantics
+for programs with type errors.
+
+\subsection{Strict types}
+
+Goal: \emph{no false negatives.}
+
+For developers who are interested in defect detection, Luau provides a \emph{strict mode},
+which acts much like a traditional, sound, type system. This has the goal of ``no false negatives'' that is any
+run-time error is flagged. This is formalized using:
+\begin{itemize}
+\item \emph{Operational semantics}: a reduction judgment $M \rightarrow N$ on terms.
+\item \emph{Values}: a subset of terms representing a successfully completed evaluation.
+\end{itemize}
+Error states at runtime are represented as stuck states (terms that are not
+values but cannot reduce), and showing that no well-typed program is
+stuck. This is not true if typing is infallible, but can fairly
+straightforwardly be adapted. We extend the operational semantics to flagged terms,
+where $M \rightarrow M'$ implies $\squnder{M} \rightarrow \squnder{M'}$, and
+for any value $V$ we have $\squnder{V} \rightarrow V$, then show:
+\begin{itemize}
+\item \emph{Progress}: if ${} \vdash M \Rightarrow N : T$, then either $N \rightarrow N'$ or $N$ is a value or $N$ has a flagged subterm.
+\item \emph{Preservation}: if ${} \vdash M \Rightarrow N : T$ and $N \rightarrow N'$ then  $M \rightarrow^*M'$ and ${} \vdash M' \Rightarrow N' : T$.
+\end{itemize}
+Some issues raised by infallible types:
+\begin{itemize}
+\item How should the judgments and their metatheory be set up?
+\item How should type inference and generic functions be handled?
+\item Is the operational semantics of flagged values
+  ($\squnder{V} \rightarrow V$) the right one?
+\item Will higher-order code require wrappers on functions? 
+\end{itemize}
+\emph{Related work}: gradual typing and blame analysis, e.g.~\cite{GradualTyping,WellTyped,Contracts}
+
+\subsection{Nonstrict types}
+
+Goal: \emph{no false positives.}
+
+For developers who are not interested in defect detection, type-driven
+tools and techniques such as autocomplete, API documentation
+and support for refactoring can still be useful.
+For such developers, Luau provides a
+\emph{nonstrict mode}, which we hope will eventually be useful for all
+developers. This does \emph{not} aim for soundness, but instead has
+the goal of ``no false positives``, in the sense that any flagged code
+is guaranteed to produce a runtime error when executed.
+
+On the face of it, this is undecidable, since a program such as
+$(\IF f() \THEN \ERROR \END)$ will produce a runtime error when $f()$ is
+$\TRUE$, but we can aim for a weaker property, that all flagged code
+is either dead code or will produce an error. Either of these is a
+defect, so deserves flagging, even if the tool does not know
+which reason applies.
+
+We can formalize this by defining an \emph{evaluation context}
+$\evCtx[\bullet]$, and saying $M$ is \emph{incorrectly flagged}
+if it is of the form $\evCtx[\squnder{V}]$. We can then define:
+\begin{itemize}
+\item \emph{Correct flagging}: if ${} \vdash M \Rightarrow N : T$
+  then $N$ is correctly flagged.
+\end{itemize}
+Some issues raised by nonstrict types:
+\begin{itemize}
+
+\item Under this definition, any function that will terminate is unflagged, so
+  flagging will often move from function definitions to call sites.
+
+\item This definition will not allow an unchecked use of an optional value
+  to be flagged, for example if $f() : \NUMBER?$ (meaning $f$ may optionally return a number)
+  then a strict type system can flag $1 + f()$ but a nonstrict one cannot.
+
+\item Property update of tables in languages like Luau always succeeds
+  (the property is inserted if it did not exist), and so functions which
+  update properties cannot be flagged.
+
+\item Does nonstrict typing require whole program analysis,
+  to find all the possible types a property might be updated with?
+
+\item The natural formulation of function types in a nonstrict setting
+  is that of~\cite{???}: if $f: T \rightarrow U$ and $f(V) \rightarrow^* W$
+  then $V:T$ and $W:U$. This formulation is \emph{covariant} in $T$,
+  not \emph{contavariant}; what impact does this have?
+  
+\end{itemize}
+\emph{Related work}: success types~\cite{SuccessTyping} and incorrectness logic~\cite{IncorrectnessLogic}.
+
+\subsection{Mixing types}
+
+Goal: \emph{support mixed strict/nonstrict development}.
+
+Like every active software community, Roblox developers share code
+with one another constantly.  First- and third-party developers alike
+frequently share entire software packages written in Luau.  To add to
+this, many Roblox games are authored not by just one developer, but a
+team.
+It is therefore crucial that we offer first-class support for mixing
+code written in strict and nonstrict modes.
+
+Some issues raised by mixed-mode types:
+\begin{itemize}
+
+\item How much feedback can we offer for a nonstrict script that is
+  importing strict-mode code?
+
+\item In strict mode, how do we talk about values and types that are
+  drawn from nonstrict code?
+
+\item How can we combine the goals of strict and nonstrict types?
+
+\item Can we have strict and non-strict mode infer the same types,
+  only with different flagging?
+
+\end{itemize}
+\emph{Related work}: this appears to be an under-explored area.
+
+\section{Conclusions}
+
+In this paper, we have presented some of the goals of the Luau type
+system, and how they map to the needs of the Roblox creator
+community. We have sketched what a solution might look like, all that
+remains is to draw the owl~\cite{HowToDrawAnOwl}.
+
+\bibliographystyle{ACM-Reference-Format} \bibliography{bibliography}
+
+\end{document}
--- a/papers/hatra21/studio-mow.png
+++ b/papers/hatra21/studio-mow.png
--- a/papers/hatra21/studio-script-editor.png
+++ b/papers/hatra21/studio-script-editor.png