Add a position paper about the goals of Luau (#55)

Added a position paper on Luau for submission to HATRA 2021

Co-authored-by: Lily Brown <lbrown@roblox.com>
Co-authored-by: Andy Friesen <afriesen@roblox.com>
This commit is contained in:
Alan Jeffrey 2021-07-20 13:41:04 -05:00 committed by GitHub
parent 01d9f8daf9
commit 7c7eb800f5
No known key found for this signature in database
GPG Key ID: 4AEE18F83AFDEB23
7 changed files with 507 additions and 0 deletions

9
papers/.gitignore vendored Normal file
View File

@ -0,0 +1,9 @@
*.aux
*.bbl
*.blg
*.dvi
*.fdb_latexmk
*.fls
*.log
*.out
*.xcp

43
papers/hatra21/README.md Normal file
View File

@ -0,0 +1,43 @@
# HATRA 21 position paper
A position paper on Luau for [Human Aspects of Types and Reasoning Assistants](https://2021.splashcon.org/home/hatra-2021) (HATRA) 2021.
## Installing latexmk
First install basictex
```
sudo brew install basictex
```
Then install the dependencies for the paper (sigh, by hand):
```
sudo tlmgr update --all
sudo tlmgr install acmart
sudo tlmgr install iftex
sudo tlmgr install xstring
sudo tlmgr install environ
sudo tlmgr install totpages
sudo tlmgr install trimspaces
sudo tlmgr install manyfoot
sudo tlmgr install ncctools
sudo tlmgr install comment
sudo tlmgr install balance
sudo tlmgr install preprint
sudo tlmgr install libertine
sudo tlmgr install inconsolata
sudo tlmgr install newtx
sudo tlmgr install latexmk
```
## Building the paper
To build the paper:
```
latexmk --pdf hatra21
```
To run latexmk in watching mode (where it rebuilds the PDF on each change):
```
latexmk --pdf --pvc hatra21
```

View File

@ -0,0 +1,115 @@
@Misc{Roblox,
author = {Roblox},
title = {What is {Roblox}},
year = 2021,
url = {https://corp.roblox.com},
}
@Misc{Luau,
author = {Roblox},
title = {The {Luau} Programming Language},
year = 2021,
url = {https://luau-lang.org},
}
@Misc{Lua,
author = {Lua.org and {PUC}-Rio},
title = {The {Lua} Programming Language},
year = 2021,
url = {https://lua.org},
}
@Misc{AllEducators,
author = {Roblox},
title = {Roblox Education: All Educators},
year = {2021},
url = {https://education.roblox.com/en-us/educators},
}
@Misc{RobloxDevelopers,
author = {Roblox},
title = {Roblox Developers Expected to Earn Over \$250 Million in 2020; Platform Now Has Over 150 Million Monthly Active Users
},
year = {2020},
url = {https://corp.roblox.com/2020/07/roblox-developers-expected-earn-250-million-2020-platform-now-150-million-monthly-active-users/},
}
@Book{TAPL,
author = {Benjamin C. Pierce},
title = {Types and Programming Languages},
publisher = {{MIT} Press},
year = {2002},
isbn = {0-262-16209-1},
}
@Book{TDDIdris,
author = {Edwin Brady},
title = {Type-Driven Development with {Idris}},
publisher = {Manning},
year = {2017},
isbn = {9781617293023},
}
@PhdThesis{TopQuality,
author = {Bastiaan J. Heeren},
title = {Top Quality Type Error Messages},
school = {U. Utrecht},
year = {2005},
}
@PhdThesis{RepairingTypeErrors,
author = {Bruce J. McAdam},
title = {Repairing Type Errors in Functional Programs},
school = {U. Edinburgh},
year = {2002},
}
@InProceedings{GradualTyping,
author = {Jeremy G. Siek and Walid Taha},
title = {Gradual Typing for Functional Languages},
booktitle = {Proc. Scheme and Functional Programming Workshop},
year = {2006},
pages = {81-92},
}
@InProceedings{WellTyped,
author = {Philip Wadler and Robert B. Findler},
title = {Well-typed Programs Cant be Blamed},
booktitle = {Proc. European Symp. Programming},
year = {2009},
pages = {1-16},
}
@InProceedings{Contracts,
author = {Robert B. Findler and Matthias Felleisen},
title = {Contracts for Higher-order Functions},
booktitle = {Proc. Int. Conf. Functional Programming},
year = {2002},
pages = {48-59},
}
@inproceedings{SuccessTyping,
author = {Lindahl, Tobias and Sagonas, Konstantinos},
title = {Practical Type Inference Based on Success Typings},
year = {2006},
booktitle = {Proc. Int. Conf. Principles and Practice of Declarative Programming},
pages = {167178},
}
@InProceedings{IncorrectnessLogic,
author = {O'Hearn, Peter W.},
title = {Incorrectness Logic},
year = {2020},
booktitle = {Proc. Symp. Principles of Programming Languages},
articleno = {10},
pages = {1-32},
}
@Misc{HowToDrawAnOwl,
author = {Know Your Meme},
title = {How To Draw An Owl},
year = {2010},
url = {https://knowyourmeme.com/memes/how-to-draw-an-owl},
}

BIN
papers/hatra21/hatra21.pdf Normal file

Binary file not shown.

340
papers/hatra21/hatra21.tex Normal file
View File

@ -0,0 +1,340 @@
\documentclass[acmsmall]{acmart}
\setcopyright{rightsretained}
\copyrightyear{2021}
\acmYear{2021}
\acmConference[HATRA '21]{Human Aspects of Types and Reasoning Assistants}{October 2021}{Chicago, IL}
\acmBooktitle{HATRA '21: Human Aspects of Types and Reasoning Assistants}
\newcommand{\squnder}[1]{\color{red}\underline{{\color{black}#1}}\color{black}}
\newcommand{\infer}[2]{\frac{\textstyle#1}{\textstyle#2}}
\newcommand{\erase}{\mathrm{erase}}
\newcommand{\evCtx}{\mathcal{E}}
\newcommand{\NIL}{\mathsf{nil}}
\newcommand{\TRUE}{\mathsf{true}}
\newcommand{\FALSE}{\mathsf{false}}
\newcommand{\NUMBER}{\mathsf{number}}
\newcommand{\ERROR}{\mathsf{error}}
\newcommand{\IF}{\mathsf{if}\,}
\newcommand{\THEN}{\,\mathsf{then}\,}
\newcommand{\ELSE}{\,\mathsf{else}\,}
\newcommand{\END}{\,\mathsf{end}}
\newcommand{\FUNCTION}{\mathsf{function}\,}
\newcommand{\RETURN}{\mathsf{return}\,}
\begin{document}
\title{Position Paper: Some Goals of the Luau Type System}
\author{Lily Brown}
\author{Andy Friesen}
\author{Alan Jeffrey}
\affiliation{
\institution{Roblox}
\city{San Mateo}
\state{CA}
\country{USA}
}
\begin{abstract}
Luau is the scripting language used in creating Roblox experiences.
It is a statically-typed language based on the dynamically-typed Lua language,
with type inference. These types are used in the
IDE, for example when providing autocomplete suggestions. In this
paper, we describe some of the goals of the Luau type system,
focusing on where the goals are different from those of other type systems.
\end{abstract}
\maketitle
\section{Introduction}
The Roblox~\cite{Roblox} platform allows anyone to create shared,
immersive, 3D experiences. At the time of writing, there are
approximately 20~million experiences available on Roblox, created
by 8~million developers. Roblox creators are often young, for
example there are over 200~Roblox kids' coding camps in 65~countries
listed at~\cite{AllEducators}.
The Luau programming language~\cite{Luau} is the scripting language
used by developers of Roblox experiences. Luau is derived from the Lua
programming language~\cite{Lua}, with additional capabilities,
including a type inference engine.
This paper will discuss some of the goals of the Luau type system,
focusing on where the goals are different from those of other type systems.
\section{Human Aspects}
\subsection{Heterogeneous developer community}
Quoting a Roblox 2020 report \cite{RobloxDevelopers}:
\begin{itemize}
\item Adopt Me! now has over 10 billion plays and surpassed 1.6 million concurrent users in game earlier this year.
\item Piggy, launched in January 2020, has close to 5 billion visits in just over six months.
\item There are now 345,000 developers on the platform who are monetizing their games.
\end{itemize}
This demonstrates how heterogeneous the Roblox developer community is:
developers of experiences with billions of plays are on the same
platform as children first learning to code. Moreover, \emph{both of
these groups are important}, as the professional development studios
bring high-quality experiences to the platform, and the beginning creators
contribute to the energetic creative community, and will form the next generation of developers.
\subsection{Goal-driven learning}
All developers are goal-driven, but this is especially true for
learners. A learner will download Roblox Studio (the IDE) with an
experience in mind, often designing an obstacle course (an ``obby'')
to play in with their friends.
The user experience of developing a Roblox experience is primarily a
3D interactive one, seen in Fig.~\ref{fig:studio}(a). The user designs
and deploys 3D assets such as terrain, parts and joints, and provides
them with physics attributes such as mass and orientation. The user
can interact with the experience in Studio, and deploy it to a Roblox
server so anyone with the Roblox app can play it. Physics, rendering
and multiplayer are all immediately accessible to all creators.
\begin{figure}
\includegraphics[width=0.48\textwidth]{studio-mow.png}
\includegraphics[width=0.48\textwidth]{studio-script-editor.png}
\caption{Roblox Studio's 3D environment editor (a), and script editor (b)}
\label{fig:studio}
\end{figure}
At some point during experience design, the user of Studio has a need
which can't be met by the physics engine alone. ``The stairs should
light up when a player walks on them'' or ``a firework is set off
every few seconds.'' At this point they will discover the script
editor, seen in Fig.~\ref{fig:studio}(b).
This onboarding experience is different from many initial exposures to
programming, in that by the time the user first opens the script
editor, they have already built much of their creation, and have a
very specific concrete aim. It suggests a Luau goal for helping the
majority of creators: \emph{support learning how to perform specific
tasks} (for example through autocomplete suggestions).
\subsection{Type-driven development}
Professional development studios are also goal-directed (though the
goals may be more abstract, such as ``decrease user churn'' or
``improve frame rate'') but have additional needs:
\begin{itemize}
\item \emph{Code planning}:
code spends much of its development time in an incomplete state,
with holes that will be filled in later.
\item \emph{Code refactoring}:
experiences evolve over time, and it easy for changes to
break previously-held invariants.
\item \emph{Defect detection}:
code has errors, and detecting these at runtime (for example by crash telemetry)
can be expensive and recovery can be time-consuming.
\end{itemize}
Detecting defects ahead-of-time is a traditional goal of type systems,
resulting in an array of techniques for establishing safety results,
surveyed for example in~\cite{TAPL}. Supporting code planning and
refactoring are some of the goals of \emph{type-driven
development}~\cite{TDDIdris} under the slogan ``type, define,
refine''. For example. a common use of type-driven development is to
rename a property, which is achieved by changing the name in one place,
and then fixing the resulting type errors---once the type system stops
reporting errors, the refactoring is complete.
To help support the transition from novice to experienced developer,
types are introduced gradually, through API documentation and type discovery.
Type inference provides many of the benefits of type-driven development
even to creators who are not explicitly providing types.
\section{Types}
\subsection{Infallible types}
Goal: \emph{support type-driven tools for all programs}.
Programs spend much of their time under development in an incomplete state, even if the final artifact
is well-typed. Tools should support this by providing type information even for ill-typed programs.
An analogy is infallible parsers, which perform error recovery and provide an AST for all input texts.
Program analysis can still flag type errors, for example with red
squiggly underlining. Formalizing this, rather than a judgment
$\Gamma\vdash M:T$, for an input term $M$, there is a judgment
$\Gamma \vdash M \Rightarrow N : T$ where $N$ is an output term
where some subterms are \emph{flagged} as having type errors, written $\squnder{N}$. Write $\erase(N)$
for the result of erasing flaggings: $\erase(\squnder{N}) = \erase(N)$.
%% For example the usual
%% type rules for field access becomes:
%% \[
%% \infer{
%% \Gamma \vdash M \Rightarrow M' : T
%% }{
%% \Gamma \vdash M.\ell \Rightarrow M'.\ell : U
%% }
%% [
%% T = \{ \overline{\ell:U} \} \mbox{ and } (\ell:U) \in (\overline{\ell:U})
%% ]
%% \]
%% but there is also a rule for unsuccessful field access:
%% \[
%% \infer{
%% \Gamma \vdash M \Rightarrow M' : T
%% }{
%% \Gamma \vdash M.\ell \Rightarrow \squnder{M'.\ell} : U
%% }
%% [
%% T = \{ \overline{\ell:U} \} \mbox{ implies } \ell \not\in \overline{\ell}
%% ]
%% \]
%% In this type rule, $U$ is unconstrained.
The goal of infallible types is that every term can be typed:
\begin{itemize}
\item \emph{Typability}: for every $M$ and $\Gamma$,
there are $N$ and $T$ such that $\Gamma \vdash M \Rightarrow N : T$.
\item \emph{Erasure}: if $\Gamma \vdash M \Rightarrow N : T$
then $\erase(M) = \erase(N)$
\end{itemize}
Some issues raised by infallible types:
\begin{itemize}
\item Which heuristics should be used to provide types for flagged programs? For example, could one
use minimal edit distance to correct for spelling mistakes in field names?
\item How can we avoid cascading type errors, where a developer is
faced with type errors that are artifacts of the heuristics rather
than genuine errors?
\item How can the goals of an infallible type system be formalized?
\end{itemize}
\emph{Related work}:
there is a large body of work on type error reporting
(see, for example, the survey in~\cite[Ch.~3]{TopQuality})
and on type-directed program repair
(see, for example, the survey in~\cite[Ch.~3]{RepairingTypeErrors}),
but not type repair, or on
the semantics of programs with type errors. Many compilers perform
error recovery during typechecking, but do not provide a semantics
for programs with type errors.
\subsection{Strict types}
Goal: \emph{no false negatives.}
For developers who are interested in defect detection, Luau provides a \emph{strict mode},
which acts much like a traditional, sound, type system. This has the goal of ``no false negatives'' that is any
run-time error is flagged. This is formalized using:
\begin{itemize}
\item \emph{Operational semantics}: a reduction judgment $M \rightarrow N$ on terms.
\item \emph{Values}: a subset of terms representing a successfully completed evaluation.
\end{itemize}
Error states at runtime are represented as stuck states (terms that are not
values but cannot reduce), and showing that no well-typed program is
stuck. This is not true if typing is infallible, but can fairly
straightforwardly be adapted. We extend the operational semantics to flagged terms,
where $M \rightarrow M'$ implies $\squnder{M} \rightarrow \squnder{M'}$, and
for any value $V$ we have $\squnder{V} \rightarrow V$, then show:
\begin{itemize}
\item \emph{Progress}: if ${} \vdash M \Rightarrow N : T$, then either $N \rightarrow N'$ or $N$ is a value or $N$ has a flagged subterm.
\item \emph{Preservation}: if ${} \vdash M \Rightarrow N : T$ and $N \rightarrow N'$ then $M \rightarrow^*M'$ and ${} \vdash M' \Rightarrow N' : T$.
\end{itemize}
Some issues raised by infallible types:
\begin{itemize}
\item How should the judgments and their metatheory be set up?
\item How should type inference and generic functions be handled?
\item Is the operational semantics of flagged values
($\squnder{V} \rightarrow V$) the right one?
\item Will higher-order code require wrappers on functions?
\end{itemize}
\emph{Related work}: gradual typing and blame analysis, e.g.~\cite{GradualTyping,WellTyped,Contracts}
\subsection{Nonstrict types}
Goal: \emph{no false positives.}
For developers who are not interested in defect detection, type-driven
tools and techniques such as autocomplete, API documentation
and support for refactoring can still be useful.
For such developers, Luau provides a
\emph{nonstrict mode}, which we hope will eventually be useful for all
developers. This does \emph{not} aim for soundness, but instead has
the goal of ``no false positives``, in the sense that any flagged code
is guaranteed to produce a runtime error when executed.
On the face of it, this is undecidable, since a program such as
$(\IF f() \THEN \ERROR \END)$ will produce a runtime error when $f()$ is
$\TRUE$, but we can aim for a weaker property, that all flagged code
is either dead code or will produce an error. Either of these is a
defect, so deserves flagging, even if the tool does not know
which reason applies.
We can formalize this by defining an \emph{evaluation context}
$\evCtx[\bullet]$, and saying $M$ is \emph{incorrectly flagged}
if it is of the form $\evCtx[\squnder{V}]$. We can then define:
\begin{itemize}
\item \emph{Correct flagging}: if ${} \vdash M \Rightarrow N : T$
then $N$ is correctly flagged.
\end{itemize}
Some issues raised by nonstrict types:
\begin{itemize}
\item Under this definition, any function that will terminate is unflagged, so
flagging will often move from function definitions to call sites.
\item This definition will not allow an unchecked use of an optional value
to be flagged, for example if $f() : \NUMBER?$ (meaning $f$ may optionally return a number)
then a strict type system can flag $1 + f()$ but a nonstrict one cannot.
\item Property update of tables in languages like Luau always succeeds
(the property is inserted if it did not exist), and so functions which
update properties cannot be flagged.
\item Does nonstrict typing require whole program analysis,
to find all the possible types a property might be updated with?
\item The natural formulation of function types in a nonstrict setting
is that of~\cite{???}: if $f: T \rightarrow U$ and $f(V) \rightarrow^* W$
then $V:T$ and $W:U$. This formulation is \emph{covariant} in $T$,
not \emph{contavariant}; what impact does this have?
\end{itemize}
\emph{Related work}: success types~\cite{SuccessTyping} and incorrectness logic~\cite{IncorrectnessLogic}.
\subsection{Mixing types}
Goal: \emph{support mixed strict/nonstrict development}.
Like every active software community, Roblox developers share code
with one another constantly. First- and third-party developers alike
frequently share entire software packages written in Luau. To add to
this, many Roblox games are authored not by just one developer, but a
team.
It is therefore crucial that we offer first-class support for mixing
code written in strict and nonstrict modes.
Some issues raised by mixed-mode types:
\begin{itemize}
\item How much feedback can we offer for a nonstrict script that is
importing strict-mode code?
\item In strict mode, how do we talk about values and types that are
drawn from nonstrict code?
\item How can we combine the goals of strict and nonstrict types?
\item Can we have strict and non-strict mode infer the same types,
only with different flagging?
\end{itemize}
\emph{Related work}: this appears to be an under-explored area.
\section{Conclusions}
In this paper, we have presented some of the goals of the Luau type
system, and how they map to the needs of the Roblox creator
community. We have sketched what a solution might look like, all that
remains is to draw the owl~\cite{HowToDrawAnOwl}.
\bibliographystyle{ACM-Reference-Format} \bibliography{bibliography}
\end{document}

Binary file not shown.

After

Width:  |  Height:  |  Size: 589 KiB

Binary file not shown.

After

Width:  |  Height:  |  Size: 294 KiB