1 \documentclass[runningheads]{llncs}
7 % \myincludegraphics{filename}{place}{width}{caption}{label}
8 \newcommand{\myincludegraphics}[5]{
11 \includegraphics[width=#3]{eps/#1.eps}
18 %\usepackage[show]{ed}
19 %\usepackage{draftstamp}
21 \newcommand{\musing}{\texttt{musing}}
22 \newcommand{\musings}{\texttt{musings}}
23 \newcommand{\ws}{Web-Service}
24 \newcommand{\wss}{Web-Services}
25 \newcommand{\hbugs}{H-Bugs}
26 \newcommand{\helm}{HELM}
27 \newcommand{\Omegapp}{$\Omega$mega}
28 \newcommand{\OmegaAnts}{$\Omega$mega-Ants}
30 \title{Brokers and Web-Services for Automatic Deduction: a Case Study}
33 Claudio Sacerdoti Coen\thanks{Partially supported by `MoWGLI: Math on the Web, Get it by Logic and Interfaces', EU IST-2001-33562} \and
34 Stefano Zacchiroli\thanks{Partially supported by `MyThS: Models and Types for Security in Mobile Distributed Systems', EU FET-GC IST-2001-32617}}
37 Department of Computer Science\\
38 University of Bologna\\
39 Mura Anteo Zamboni 7, 40127 Bologna, ITALY\\
40 \email{sacerdot@cs.unibo.it}
42 Department of Computer Science\\
43 \'Ecole Normale Sup\'erieure\\
44 45, Rue d'Ulm, F-75230 Paris Cedex 05, FRANCE\\
45 \email{zack@cs.unibo.it}
55 We present a planning broker and several Web-Services for automatic deduction.
56 Each Web-Service implements one of the tactics usually available in
57 interactive proof-assistants. When the broker is submitted a ``proof status''
58 (an incomplete proof tree and a focus on an open goal) it dispatches the proof
59 to the Web-Services, collects the successful results, and send them back to
60 the client as ``hints'' as soon as they are available.
62 In our experience this architecture turns out to be helpful both for
63 experienced users (who can take benefit of distributing heavy computations)
64 and beginners (who can learn from it).
67 \section{Introduction}
68 The \ws{} approach at software development seems to be a working solution for
69 getting rid of a wide range of incompatibilities between communicating
70 software applications. W3C's efforts in standardizing related technologies
71 grant longevity and implementations availability for frameworks based on
72 \wss{} for information exchange. As a direct consequence, the number of such
73 frameworks is increasing and the World Wide Web is moving from a disorganized
74 repository of human-understandable HTML documents to a disorganized repository
75 of applications working on machine-understandable XML documents both for input
78 The big challenge for the next future is to provide stable and reliable
79 services over this disorganized, unreliable, and ever-evolving architecture.
80 The standard solution is to provide a further level of stable services (called
81 \emph{brokers}) that behave as common gateways/addresses for client
82 applications to access a wide variety of services and abstract over them.
84 Since the \emph{Declaration of Linz}, the MONET
85 Consortium\footnote{\url{http://monet.nag.co.uk/cocoon/monet/index.html}}
86 is working on the development of a framework, based on the
87 \wss{}/brokers approach, aimed at providing a set of software tools for the
88 advertisement and the discovery of mathematical \wss{}.
89 %CSC This framework turns out to be strongly based on both \wss{} and brokers.
91 Several groups have already developed software bus and
92 services\footnote{The most part of these systems predate the development of
93 \wss. Those systems whose development is still active are slowly being
94 reimplemented as \wss.} providing both computational and reasoning
95 capabilities \cite{ws1,ws2,ws3,ws4}: the first ones are implemented on top of
96 Computer Algebra Systems; the second ones provide interfaces to well-known
98 Proof-planners, proof-assistants, CASs and
99 domain-specific problem solvers are natural candidates to be clients of these
100 services. Nevertheless, so far the number of examples in the literature has
101 been insufficient to fully assess the concrete benefits of the framework.
103 In this paper we present an architecture, namely \hbugs{}, implementing a
104 \emph{suggestion engine} for the proof assistant developed on behalf of the
105 \helm{}\footnote{Hypertextual Electronic Library of Mathematics,
106 \url{http://helm.cs.unibo.it}} project
107 \cite{helm}. We provide several \wss{} (called \emph{tutors}) able to
108 suggest possible ways to proceed in a proof. The tutors are orchestrated
109 by a broker (a \ws{} itself) that is able to dispatch a proof
110 status from a client (the proof-assistant) to the tutors;
111 each tutor tries to make progress in the proof and, in case
112 of success, notifies the client that shows an \emph{hint} to the user.
113 The broker is an instance of the homonymous entity of the MONET framework.
114 The tutors are MONET services. Another \ws{} (which is not described in this
115 paper and which is called Getter \cite{zack}) is used to locate and download
116 mathematical entities; the Getter plays the role of the Mathematical Object
117 Manager of the MONET framework.
119 A precursor of \hbugs{} is the \OmegaAnts{} project
120 \cite{omegaants1,omegaants2}, which provided similar functionalities to the
121 \Omegapp{} proof-planner \cite{omega}. The main architectural difference
122 between \hbugs{} and \OmegaAnts{} is that the latter is based on a
123 black-board architecture and it is not implemented using \wss{} and
126 In Sect. \ref{architecture} we present the architecture of \hbugs{}.
127 A usage session is shown in Sect. \ref{usage}.
128 Further implementation details are given in Sect. \ref{implementation}.
129 Sect. \ref{tutors} is an overview of the tutors that have been implemented.
130 As usual, the final section of this paper is devoted to conclusions and future works.
132 \section{An \hbugs{} Bird's Eye View}
134 \myincludegraphics{arch}{t}{8cm}{\hbugs{} architecture}{\hbugs{} architecture}
136 The \hbugs{} architecture (depicted in Fig. \ref{arch}) is based on three
137 different kinds of actors: \emph{clients}, \emph{brokers}, and \emph{tutors}.
138 Each actor presents one or more \ws{} interfaces to its neighbors \hbugs{}
141 In this section we detail the role and requirements of each kind of
142 actors and we discuss about the correspondences between them and the MONET
143 entities described in \cite{MONET-Overview}.
144 Due to lack of space, we cannot compare our framework to similar proposals, as
145 the older and more advanced \Omegapp{} system. The study of the
146 correspondences with MONET is well motivated by the fact that the MONET
147 framework is still under development and that our implementation is one of the
148 first experiments in \ws based distributed reasoning. On the other hand, a
149 comparison with \Omegapp{} would be less interesting since the functionalities we
150 provide so far are just a subset of the \OmegaAnts{} ones.
153 An \hbugs{} client is a software component able to produce \emph{proof
154 status} and to consume \emph{hints}.
156 A proof status is a representation of an incomplete proof and is supposed to
157 be informative enough to be used by an interactive proof assistant. No
158 additional requirements exist on the proof status, but there should be an
159 agreement on its format between clients and tutors. A hint is an
160 encoding of a step that can be performed in order to proceed in an
161 incomplete proof. Usually it represents a reference to a tactic available
162 on some proof assistant along with an instantiation for its formal
163 parameters. Hints can also be more structured: a hint can be
164 as complex as a whole proof-plan.
166 Using W3C's terminology \cite{ws-glossary}, clients act both as \ws{}
167 providers and requesters, see Fig. \ref{interfaces}.
168 They act as providers receiving hints from the broker; they act as
169 requesters submitting new status to the tutors.
170 Clients additionally use broker services to know which tutors are available
171 and to subscribe to one or more of them.
173 Usually, when the client role is taken by an interactive proof assistant,
174 new status are sent to the broker as soon as the proof change (e.g. when the
175 user applies a tactic or when a new proof is started); hints are shown to
176 the user by the means of some effects in the user interface (e.g. popping a
177 dialog box or enlightening a tactic button).
179 \hbugs{} clients act as MONET clients and ask brokers to provide access to a
180 set of services (the tutors). \hbugs{} has no actors corresponding to
181 MONET's Broker Locating Service (since the client is supposed to know the
182 URI of at least one broker). The \hbugs{} clients and tutors contact the
183 Getter (a MONET Mathematical Object Manager) to locate and retrieve
184 mathematical items from the \helm{} library.
185 The proof status that are exchanged
186 by the \hbugs{} actors, instead, are built on the fly and are neither
187 stored nor given an unique identifier (URI) to be managed by the
191 \myincludegraphics{interfaces}{t!}{10cm}{\hbugs{} \wss{} interfaces}
192 {\hbugs{} \wss{} interfaces}
194 Brokers are the key actors of the \hbugs{} architecture since they
195 act as intermediaries between clients and tutors. They behave as \wss{}
196 providers and requesters for \emph{both} clients and tutors, see Fig.
199 With respect to the client, a broker acts as a \ws{} provider, receiving the
200 proof status and forwarding it to one or more tutors.
201 It also acts as a \ws{} requester sending
202 hints to the client as soon as they are available from the tutors.
204 With respect to the tutors, the \ws{} provider role is accomplished by
205 receiving hints as soon as they are produced; as a requester, it is
206 accomplished by asking for computations (\emph{musings} in \hbugs{}
207 terminology) on status received by clients and by stopping already late but
208 still ongoing \musings{}.
210 Additionally brokers keep track of available tutors and clients
213 \hbugs{} brokers act as MONET brokers implementing the following components:
214 Client Manager, Service Registry Manager (keeping track of available
215 tutors), Planning Manager (choosing the available tutors among the ones to
216 which the client is subscribed), Execution Manager. The Service Manager
217 component is not required since the session handler, that identifies
218 a session between a service and a broker, is provided to the service by
219 the broker instead of being received from the service when the session is
220 initialized. In particular, a session is identified by an unique identifier
221 for the client (its URL) and an unique identifier for the broker (its
224 Notice that \hbugs{} brokers have no knowledge of the domain area of
225 proof-assistants, nor they are able to interpret the messages that they
226 are forwarding. They are indeed only in charge of maintaining the
227 abstraction of several reasoning blackboards --- one for each client ---
228 of capacity one: a blackboard is created when the client submits a problem;
229 it is then ``shared'' by the client and all the tutors until the client
230 submits the next problem. For instance, replacing the client with a CAS and
231 all the tutors with agents implementing different resolution methods for
232 differential equations would not require any change in the broker. Notice
233 that all the tutors must expose the same interface to the broker.
235 The MONET architecture specification does not state explicitly whether the
236 service and broker answers can be asynchronous. Nevertheless, the
237 described information flow implicitly suggests a synchronous implementation.
238 On the contrary, in \hbugs{} every request is asynchronous: the connection
239 used by an actor to issue a query is immediately closed; when a service
240 produces an answer, it gives it back to the issuer by calling the
241 appropriate actor's method.
244 Tutors are software components able to consume proof status producing hints.
245 \hbugs{} does not specify by which means hints should be produced: tutors
246 can use any means necessary (heuristics, external theorem prover or CAS,
247 etc.). The only requirement is that there exists an agreement on the formats
248 of proof status and hints.
250 Tutors act both as \ws{} providers and requesters for the broker, see Fig.
252 providers, they wait for commands requesting to start a new \musing{} on
253 a given proof status or to stop an old, out of date, \musing{}. As
254 requesters, they signal to the broker the end of a \musing{} along with its
255 outcome (a hint in case of success or a failure notification).
257 \hbugs{} tutors act as MONET services.
259 \section{An \hbugs{} Session Example}
261 In this section we describe a typical \hbugs{} session. The aim of the
262 session is to solve the following easy exercise:
264 Let $x$ be a generic real number. Using the \helm{} proof-engine,
267 x = \frac{(x+1)*(x+1) - 1 - x*x}{2}
271 Let us suppose that the \hbugs{} broker is already running and that the
272 tutors already registered themselves to the broker.
273 When the user starts our proof-engine \texttt{gTopLevel}, the system registers itself to
274 the broker, that sends back the list of available tutors. By default,
275 \texttt{gTopLevel} notifies to the broker its intention of subscribing to every
276 tutor available. The user can always open a configuration window where she
277 is presented the list of available tutors and she can independently subscribe
278 and unsubscribe herself to each tutor.
280 \myincludegraphics{step1}{t}{12cm}{Example session.}
282 %\myincludegraphics{step2}{t}{4cm}{Example session, snapshot 2.}
283 % {Example session, snapshot 2.}
285 The user can now insert into the system the statement of the theorem and start
286 proving it. Let us suppose that the first step of the user is proving
287 that the denominator 2 is different from 0. Once that this technical result
288 is proven, the user must prove the goal shown in the upper right corner
289 of the window in background in Fig. \ref{step1}.
291 While the user is wondering how to proceed in the proof, the tutors are
292 trying to progress in the proof. After a while, the tutors' suggestions
293 start to appear in the lower part of the \hbugs{} interface window
294 (the topmost window in Fig. \ref{step1}). In this case, the tutors are able
295 to produce 23 hints. The first and not very useful hint suggests to proceed in
296 the proof by exchanging the two sides of the equality.
297 The second hint suggests to reduce both sides of the equality to their normal
298 form by using only reductions which are justified by the ring structure of the
299 real numbers; the two normal forms, though, are so different that the proof is
300 not really simplified.
301 All the residual 21 hints suggest to apply one lemma from the distributed
302 library of \helm{}. The user can look at the statement of any lemma by clicking
305 The user can now look at the list of suggestions and realize that a good one is
306 applying the lemma \texttt{r\_Rmult\_mult} that allows to multiply both equality
307 members by the same scalar\footnote{Even if she does not receive the hint, the
308 user probably already knows that this is the right way to proceed. The
309 difficult part, accomplished by the hint, is guessing what is the name of the
311 Double-clicking on the hint automatically applies
312 the lemma, reducing the proof to closing three new goals. The first one asks
313 the user the scalar to use as an argument of the previous lemma; the second
314 one states that the scalar is different from 0; the third lemma (the main
315 one) asks to prove the equality between the two new members.
316 % is shown in Fig. \ref{step2} where $?_3[H;x]$ stands for
317 % the still unknown scalar argument, which can have only $H$ and $x$ as
320 The user proceeds by instantiating the scalar with the number 2. The
321 \texttt{Assumption} tutor now suggests to close the second goal (that
322 states that $2 \neq 0$) by applying the hypothesis $H$.
323 No useful suggestions, instead, are generated for the main goal
324 $2*x = 2*((x+1)*(x+1)-1-x*x)*2^{-1}$.
325 To proceed in the proof the user needs to simplify the
326 expression using the lemma $Rinv\_r\_simpl\_m$ that states that
327 $\forall x,y.\;y = x * y * x^{-1}$. Since we do not provide yet any tutor
328 suggesting simplifications, the user must find out this simplification by
329 himself. Once she founds it, the goal is reduced to proving that
330 $2*x = (x+1)*(x+1) - 1 - x*x$. This equality is easily solved by the
331 \texttt{Ring} tutor, that suggests\footnote{The \texttt{Ring} suggestion is
332 just one of the 22 hints that the user receives. It is the only hint that
333 does not open new goals, but the user right now does not have any way to know
334 that.} to the user how to complete the proof in one macrostep.
336 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
337 % Comandi da dare a gTopLevel %
338 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
340 % !x.(not (eqT ? (Rplus R1 R1) R0)) -> (eqT ? x (Rdiv (Rminus (Rminus (Rmult (Rplus x R1) (Rplus x R1)) R1) (Rmult x x)) (Rplus R1 R1)))
344 % Simpl (per fare unfold di Rdiv)
346 % (Rmult_assoc (Rplus R1 R1) (Rplus (Rplus (Rmult (Rplus x R1) (Rplus x R1)) (Ropp R1)) (Ropp (Rmult x x))) (Rinv (Rplus R1 R1)))
348 % (Rinv_r_simpl_m (Rplus R1 R1) (Rplus (Rplus (Rmult (Rplus x R1) (Rplus x R1)) (Ropp R1)) (Ropp (Rmult x x))) H)
350 %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%
352 \section{Implementation's Highlights}
353 \label{implementation}
354 In this section we present some of the most relevant implementation details of
355 the \hbugs{} architecture.
358 \paragraph{Proof status}
359 In our implementation of the \hbugs{} architecture we used the proof
360 assistant of the \helm{} project (codename \texttt{gTopLevel}) as an \hbugs{}
361 client. Thus we have implemented serialization/deserialization capabilities
362 for its internal status. In order to be able to describe \wss{} that
363 exchange status in WSDL using the XML Schema type system, we have chosen an
364 XML format as the target format for the serialization.
366 % A schematic representation of the gTopLevel internal status is depicted in
368 Each proof is represented by a tuple of four elements:
369 \emph{uri}, \emph{metasenv}, \emph{proof}, \emph{thesis}.
371 % \myincludegraphics{status}{t}{8cm}{gTopLevel proof status}{gTopLevel proof
375 \item[uri]: an URI chosen by the user at the beginning of the proof
376 process. Once (and if) proved, that URI will globally identify the term
377 inside the \helm{} library (given that the user decides to save it).
378 \item[thesis]: the ongoing proof thesis
379 \item[proof]: the current incomplete proof tree. It can contain
380 \emph{metavariables} (holes) that stands for the parts of the proof
381 that are still to be completed. Each metavariable appearing in the
382 tree references one element of the metavariables environment
384 \item[metasenv]: the metavariables environment is a list of
385 \emph{goals} (unproved conjectures).
386 In order to complete the proof, the user has to instantiate every
387 metavariable in the proof with a proof of the corresponding goal.
388 Each goal is identified by an unique identifier and has a context
389 and a type (the goal thesis). The context is a list of named
390 hypotheses (declarations and definitions). Thus the context and the goal
391 thesis form a sequent, which is the statement of the proof that will
392 be used to instantiate the metavariable occurrences.
395 Each of these information is represented in XML as described in
396 \cite{mowglicic}. Additionally, an \hbugs{} status carries the unique
397 identifier of the current goal, which is the goal the user is currently
398 focused on. Using this value it is possible to implement different client
399 side strategies: the user could ask the tutors to work on the goal
400 she is considering or to work on the other ``background'' goals.
403 A hint in the \hbugs{} architecture should carry enough information to
404 permit the client to progress in the current proof. In our
405 implementation each hint corresponds to either one of the tactics available
406 to the user in gTopLevel (together with its actual arguments) or a set
407 of alternative suggestions (a list of hints).
409 For tactics that do not require any particular argument (like tactics that
410 apply type constructors or decision procedures)
411 only the tactic name is represented in the hint. For tactics that need
412 terms as arguments (for example the \texttt{Apply} tactic that apply a
413 given lemma) the hint includes a textual representation of them, using the
414 same representation used by the interactive proof assistant when querying
415 user for terms. In order to be transmitted between \wss{}, hints are
418 It is also possible for a tutor to return more hints at once,
419 grouping them in a particular XML element.
420 This feature turns out to be particularly useful for the
421 \emph{searchPatternApply} tutor (see Sect. \ref{tutors}) that
422 queries a lemma database and returns to the client a list of all lemmas that
423 could be used to complete the proof. This particular hint is encoded as a
424 list of \texttt{Apply} hints, each of them having one of the results as term
427 We would like to stress that the \hbugs{} architecture has no dependency
428 on either the hint or the status representation: the only message parts
429 that are fixed are those representing the administrative messages
430 (the envelopes in the \wss{} terminology). In particular, the broker can
431 manage at the same time several sessions working on different status/hints
432 formats. Of course, there must be an agreement between the clients
433 and the tutors on the format of the data exchanged.
435 In our implementation the client does not trust the tutors hints:
436 being encoded as references to available tactics imply
437 that an \hbugs{} client, at the receipt of a hint, simply try to replay
439 done by a tutor on the local copy of the proof. The application of the hint
440 can even fail to type check and the client copy of the proof can be left
441 undamaged after spotting the error. Note, however, that it is still
442 possible to implement a complex tutor that looks for a proof doing
443 backtracking and that
444 send back to the client a hint whose argument is a witness (a trace) of
445 the proof found: the client applies the hint reconstructing (and checking
446 the correctness of) the proof from the witness, without having to
447 re-discover the proof itself.
449 An alternative implementation where the tutors are trusted would simply
450 send back to the client a new proof-status. Upon receiving the
451 proof-status, the client would just override its current proof status with
452 the suggested one. In the case of those clients which are implemented
453 using proof-objects (as the Coq proof-assistant, for instance), it is
454 still possible for the client to type-check the proof-object and reject
455 wrong hints. The systems that are not based on proof-objects
456 (as PVS, NuPRL, etc.), instead, must completely trust the new proof-status.
457 In this case the \hbugs{} architecture would need at least to be extended
458 with clients-tutors authentication.
460 \paragraph{Registries}
461 Being central in the \hbugs{} architecture, the broker is also responsible
462 of housekeeping operations both for clients and tutors. These operations are
463 implemented using three different data structures called \emph{registries}:
464 clients registry, tutors registry and \musings{} registry.
466 In order to use the suggestion engine a client should register itself to the
467 broker and subscribe to one or more tutors. The registration phase is
468 triggered by the client using the \texttt{Register\_client} method of the
469 broker to send him an unique identifier and its base URI as a \ws{}. After
470 the registration, the client can use the \texttt{List\_tutors} method of the
471 broker to get a list of available tutors. Eventually the client can
472 subscribe to one or more of these using the \texttt{Subscribe} method of the
473 broker. Clients can also unregister from brokers using
474 \texttt{Unregister\_client} method.
476 The broker keeps track of both registered clients and clients' subscriptions
477 in the clients registry.
479 In order to be advertised to clients during the subscription phase, tutors
480 should register to the broker using the \texttt{Register\_tutor} method of
481 the broker. This method is really similar to \texttt{Register\_client}:
482 tutors are required to send an unique identifier and a base URI for their
484 Additionally tutors are required to send an human readable description of
485 their capabilities; this information could be used by the client user to
486 decide which tutors she wants to subscribe to. As the clients, tutors can
487 unregister from brokers using \texttt{Unregister\_broker} method.
489 Each time the client status changes, it get sent sent to the
490 broker using its \texttt{Status} method. Using both the clients registry (to
491 lookup the client's subscription) and the tutors registry (to check if some tutors
492 have unsubscribed), the broker is able to decide to which tutors the
493 new status have to be forwarded.
494 % \ednote{CSC: qui o nei lavori futuri parlare
495 % della possibilit\'a di avere un vero brocker che multiplexi le richieste
496 % dei client localizzando i servizi, etc.}
498 The forwarding operation is performed using the \texttt{Start\_musing}
499 method of the tutors, that is a request to start a new computation
500 (\emph{\musing{}}) on a given status. The return value of
501 \texttt{Start\_musing} is a
502 \musing{} identifier that is saved in the \musings{} registry along with
503 the identifier of the client that triggered the \musing{}.
505 As soon as a tutor completes an \musing{}, it informs the broker
506 using its \texttt{Musing\_completed} method; the broker can now remove the
507 \musing{} entry from the \musings{} registry and, depending on its outcome,
508 inform the client. In case of success one of the \texttt{Musing\_completed}
509 arguments is a hint to be sent to the client; otherwise there is no need to
510 inform him and the \texttt{Musing\_completed} method is called
511 just to update the \musings{} registry.
513 Consulting the \musings{} registry, the broker is able to know, at each
514 time, which \musings{} are in execution on which tutor. This peculiarity is
515 exploited by the broker on invocation of the \texttt{Status} method.
516 Receiving a new status from the client implies indeed that the previous
517 status no longer exists and all \musings{} working on it should be stopped:
518 additionally to the already described behavior (i.e. starting new
519 \musings{} on the received status), the broker takes also care of stopping
520 ongoing computation invoking the \texttt{Stop\_musing} method of the tutors.
524 % As already discussed, all \hbugs{} actors act as \wss{} offering one or more
525 % services to neighbor actors. To grant as most accessibility as possible to
526 % our \wss{} we have chosen to bind them using the HTTP/POST\footnote{Given
527 % that our proof assistant was entirely developed in the Objective Caml
528 % language, we have chosen to develop also \hbugs{} in that language in order
529 % to maximize code reuse. To develop \wss{} in Objective Caml we have
530 % developed an auxiliary generic library (\emph{O'HTTP}) that can be used to
531 % write HTTP 1.1 Web servers and abstracts over GET/POST parsing. This library
532 % supports different kinds of Web servers architectures, including
533 % multi-process and multi-threaded ones.} bindings described in
534 % \cite{wsdlbindings}.
537 Each tutor exposes a \ws{} interface and should be able to work, not only for
538 many different clients referring to a common broker, but also for many
539 different brokers. The potential high number of concurrent clients imposes
540 a multi-threaded or multi-process architecture.
542 Our current implementation is based on a multi threaded architecture
543 exploiting the capabilities of the O'HTTP library \cite{zack}. Each tutor is
544 composed by one always running thread plus an additional thread for each
546 One thread is devoted to listening for incoming \ws{} requests; when a
547 request is received the control is passed to a second thread, created on the
548 fly, that handle the incoming request (usual one-thread-per-request approach
549 in web servers design).
550 In particular if the received request is \texttt{Start\_musing}, a new thread is
551 spawned to handle it; the thread in duty to handle the HTTP request
552 returns an HTTP response containing the identifier of the just started
553 \texttt{musing}, and then dies. If the received request is
554 \texttt{Stop\_musing}, instead, the spawned thread kills the thread
555 responsible for the \texttt{musing} whose identifier is the argument
556 of the \texttt{Stop\_musing} method.
558 This architecture turns out to be scalable and allows the running threads
559 to share the cache of loaded (and type-checked) theorems.
560 As we will explain in Sect. \ref{tutors}, this feature turns out to be
561 really useful for tactics that rely on a huge but fixed set of lemmas,
562 as every reflexive tactic.
564 The implementation of a tutor within the described architecture is not that
565 difficult having a language with good threading capabilities (as OCaml has)
566 and a pool of already implemented tactics (as \texttt{gTopLevel} has).
567 Working with threads is known to be really error prone due to
568 concurrent programming intrinsic complexity. Moreover, there is a
569 non-neglectable part of code that needs to be duplicated in every tutor:
570 the code to register the tutor to the broker and to handle HTTP requests;
571 the code to manage the creation and termination of threads; and the code for
572 parsing the requests and serializing the answers. As a consequence we
573 have written a generic implementation of a tutor which is parameterized
574 over the code that actually proposes the hint and over some administrative
575 data (as the port the tutor will be listening to).
577 The generic tutor skeleton is really helpful in writing new tutors.
578 Nevertheless, the code obtained by converting existing tactics into tutors
579 is still quite repetitive: every tutor that wraps a tactic has to
580 instantiate its own copy of the proof-engine kernel and, for each request,
581 it has to override its status, guess the tactic arguments, apply the tactic
582 and, in case of success, send back a hint with the tactic name and the
583 chosen arguments. Of course, the complex part of the work is guessing the
584 right arguments. For the simple case of tactics that do not require
585 any argument, though, we are able to automatically generate the whole
586 tutor code given the tactic name. Concretely, we have written a
587 tactic-based tutor template and a script that parses an XML file with
588 the specification of the tutor and generates the tutor's code.
589 The XML file describes the tutor's port, the code to invoke the tactic,
590 the hint that is sent back upon successful application and a
591 human readable explanation of the tactic implemented by the tutor.
593 \section{The Implemented \hbugs Tutors}
595 To test the \hbugs{} architecture and to assess the utility of a suggestion
596 engine for the end user, we have implemented several tutors. In particular,
597 we have investigated three classes of tutors:
599 \item \emph{Tutors for beginners}. These are tutors that implement tactics
600 which are neither computationally expensive nor difficult to understand:
601 an expert user can always understand if the tactic can be applied or not
602 without having to try it. For example, the following implemented tutors
603 belong to this class:
605 \item \emph{Assumption Tutor}: it ends the proof if the thesis is
606 equivalent\footnote{In our implementation, the equivalence relation
607 imposed by the logical framework is \emph{convertibility}. Two
608 expressions are convertible when they reduce to the same normal form.
609 Two ``equal'' terms depending on free variables can be non-convertible
610 since free variables stop the reduction. For example, $2x$ is convertible
611 with $(3-1)x$ because they both reduce to the same normal form
612 $x + x + 0$; but $2x$ is not convertible to $x2$ since the latter is
613 already in normal form.}
614 to one of the hypotheses\footnote{
615 In some cases, especially when non-trivial computations are involved,
616 the user is totally unable to figure out the convertibility of two terms.
617 In these cases the tutor becomes handy also for expert users.}.
618 \item \emph{Contradiction Tutor}: it ends the proof by \emph{reductio ad
619 adsurdum} if one hypothesis is equivalent to $False$.
620 \item \emph{Symmetry Tutor}: if the goal thesis is an equality, it
621 suggests to apply the commutative property.
622 \item \emph{Left/Right/Exists/Split/Reflexivity/Constructor Tutors}:
623 the Constructor Tutor suggests to proceed in the proof by applying one
624 or more constructors when the goal thesis is an inductive type or a
625 proposition inductively defined according to the declarative
626 style\footnote{An example of a proposition that can be given in
627 declarative style is the $\le$ relation over natural numbers:
628 $\le$ is the smallest relation
629 such that $n \le n$ for every $n$ and $n \le m$ for every $n,m$ such
630 that $n \le p$ where $p$ is the predecessor of $m$. Thus, a proof
631 of $n \le n$ is simply the application of the first constructor to
632 $n$ and a proof of $n \le m$ is the application of the second
633 constructor to $n,m$ and a proof of $n \le m$.}.
634 Since disjunction, conjunction, existential quantification and
635 Leibniz equality are particular cases of inductive propositions,
636 all the other tutors of this class are instantiations of the
637 the Constructor tactic. Left and Right suggest to prove a disjunction
638 by proving its left/right member; Split reduces the proof of a
639 conjunction to the two proof of its members; Exists suggests to
640 prove an existential quantification by providing a
641 witness\footnote{This task is left to the user.}; Reflexivity proves
642 an equality whenever the two sides are convertible.
644 Beginners, when first faced with a tactic-based proof-assistant, get
645 lost quite soon since the set of tactics is large and their names and
646 semantics must be remembered by heart. Tutorials are provided to guide
647 the user step-by-step in a few proofs, suggesting the tactics that must
648 be used. We believe that our beginners tutors can provide an auxiliary
649 learning tool: after the tutorial, the user is not suddenly left alone
650 with the system, but she can experiment with variations of the exercises given
651 in the tutorial as much as she like, still getting useful suggestions.
652 Thus the user is allowed to focus on learning how to do a formal proof
653 instead of wasting efforts trying to remember the interface to the system.
654 \item \emph{Tutors for Computationally Expensive Tactics}. Several tactics have
655 an unpredictable behavior, in the sense that it is unfeasible to understand
656 whether they will succeed or they will fail when applied and what will be
657 their result. Among them, there are several tactics either computationally
658 expensive or resource consuming. In the first case, the user is not
659 willing to try a tactic and wait for a long time just to understand its
660 outcome: she would prefer to keep on concentrating on the proof and
661 have the tactic applied in background and receive out-of-band notification
662 of its success. The second case is similar, but the tactic application must
663 be performed on a remote machine to avoid overloading the user host
664 with several concurrent resource consuming applications.
666 Finally, several complex tactics and in particular all the tactics based
667 on reflexive techniques depend on a pretty large set of definitions, lemmas
668 and theorems. When these tactics are applied, the system needs to retrieve
669 and load all the lemmas. Pre-loading all the material needed by every
670 tactic can quickly lead to long initialization times and to large memory
671 footstamps. A specialized tutor running on a remote machine, instead,
672 can easily pre-load the required theorems.
674 As an example of computationally expensive task, we have implemented
675 a tutor for the \emph{Ring} tactic \cite{ringboutin}.
676 The tutor is able to prove an equality over a ring by reducing both members
677 to a common normal form. The reduction, which may require some time in
679 is based on the usual commutative, associative and neutral element properties
680 of a ring. The tactic is implemented using a reflexive technique, which
681 means that the reduction trace is not stored in the proof-object itself:
682 the type-checker is able to perform the reduction on-the-fly thanks to
683 the conversion rules of the system. As a consequence, in the library there
684 must be stored both the algorithm used for the reduction and the proof of
685 correctness of the algorithm, based on the ring axioms. This big proof
686 and all of its lemmas must be retrieved and loaded in order to apply the
687 tactic. The Ring tutor loads and caches all the required theorems the
688 first time it is contacted.
689 \item \emph{Intelligent Tutors}. Expert users can already benefit from the previous
690 class of tutors. Nevertheless, to achieve a significative production gain,
691 they need more intelligent tutors implementing domain-specific theorem
692 provers or able to perform complex computations. These tutors are not just
693 plain implementations of tactics or decision procedures, but can be
694 more complex software agents interacting with third-parties software,
695 such as proof-planners, CAS or theorem-provers.
697 To test the productivity impact of intelligent tutors, we have implemented
698 a tutor that is interfaced with the \helm{}
699 Search-Engine\footnote{\url{http://helm.cs.unibo.it/library.html}} and that
700 is able to look for every theorem in the distributed library that can
701 be applied to proceed in the proof. Even if the tutor deductive power
702 is extremely limited\footnote{We do not attempt to check if the new goals
703 obtained applying a lemma can be automatically proved or, even better,
704 automatically disproved to reject the lemma.}, it is not unusual for
705 the tutor to come up with precious hints that can save several minutes of
706 work that would be spent in proving again already proven results or
707 figuring out where the lemmas could have been stored in the library.
710 \section{Conclusions and Future Work}
712 In this paper we described a suggestion engine architecture for
713 proof-assistants: the client (a proof-assistant) sends the current proof
714 status to several distributed \wss{} (called tutors) that try to progress
715 in the proof and, in case of success, send back an appropriate hint
716 (a proof-plan) to the user. The user, that in the meantime was able to
717 reason and progress in the proof, is notified with the hints and can decide
718 to apply or ignore them. A broker is provided to decouple the clients and
719 the tutors and to allow the client to locate and invoke the available remote
720 services. The whole architecture is an instance of the MONET architecture
721 for Mathematical \wss{}. It constitutes a reimplementation of the core
722 features of the pioneering \OmegaAnts{} system in the new \wss{}
725 A running prototype has been implemented as part of the
726 \helm{} project \cite{helm}
727 and we already provide several tutors. Some of them are simple tutors that
728 try to apply one or more tactics of the \helm{} Proof-Engine, which is also
729 our client. We also have a much more complex tutor that is interfaced
730 with the \helm{} Search-Engine and looks for lemmas that can be directly
733 Future works comprise the implementation of new features and tutors, and
734 the embedding of the system in larger test cases. For instance, one
735 interesting case study would be interfacing a CAS as Maple to the
736 \hbugs{} broker, developing at the same time a tutor that implements the
737 Field tactic of Coq, which proves the equality of two expressions in an
738 abstract field by reducing both members to the same normal form. CASs can
739 produce several compact normal forms, which are particularly informative
740 to the user and that may suggest how to proceed in a proof. Unfortunately,
742 provide any certificate about the correctness of the simplification. On
743 the contrary, the Field tactic certifies the equality of two expressions,
744 but produces normal forms that are hardly a simplification of the original
745 formula. The benefits for the CAS would be obtained by using the Field tutor
746 to certify the CAS simplifications, proving that the Field normal form
747 of an expression is preserved by the simplification.
748 More advanced tutors could exploit the CAS to reduce the
749 goal to compact normal forms \cite{maplemodeforCoq}, making the Field tutor
750 certify the simplification according to the skeptical approach.
752 We have many plans for further developing both the \hbugs{} architecture and
753 our prototype. Interesting results could be obtained
754 augmenting the informative content of each suggestion. We can for example
755 modify the broker so that also negative results are sent back to the client.
756 Those negative suggestions could be reflected in the user interface by
757 deactivating commands to narrow the choice of tactics available to the user.
758 This approach could be interesting especially for novice users, but requires
759 the client to increase their level of trust in the other actors.
761 We plan also to add some rating mechanism to the architecture. A first
762 improvement in this direction could be distinguishing between hints that, when
763 applied, are able to completely close one or more goals, and
764 tactics that progress in the proof by reducing one or more goals to new goals:
765 since the new goals can be false, the user can be forced later on to
768 Other heuristics and or measures could be added to rate
769 hints and show them to the user in a particular order: an interesting one
770 could be a measure that try to minimize the size of the generated proof,
771 privileging therefore non-overkilling solutions \cite{ring}.
773 We are also considering to follow the \OmegaAnts{} path adding
774 ``recursion'' to the system so that the proof status resulting from the
775 application of old hints are cached somewhere and could be used as a starting
776 point for new hint searches. The approach is interesting, but it represents
777 a big shift towards automatic theorem proving: thus we must consider if it is
778 worth the effort given the increasing availability of automation in proof
779 assistants tactics and the ongoing development of \wss{} based on
780 already existent and well developed theorem provers.
782 Even if not strictly part of the \hbugs{} architecture, the graphical user
783 interface (GUI) of our prototype needs a lot of improvement if we want
784 it to be really usable by novices. In particular, a critical issue
785 is avoiding continuous distractions for the user determined by the hints
786 that are asynchronously pushed to her.
788 Our \wss{} still lack a real integration in the MONET architecture,
789 since we do not provide the different ontologies to describe our problems,
790 solutions, queries, and services. In the short term, completing this task
791 could provide a significative feedback to the MONET consortium and would
792 enlarge the current set of available MONET actors on the Web. In the long
793 term, new more intelligent tutors could be developed on top of already
794 existent MONET \wss{}.
796 To conclude, \hbugs{} is a nice experiment meant to understand whether the
797 current \wss{} technology is mature enough to have a concrete and useful
798 impact on the daily work of proof-assistants users. So far, only the tutor
799 that is interfaced with the \helm{} Search-Engine has effectively increased
800 the productivity of experts users. The usefulness of the tutors developed for
801 beginners, instead, need further assessment.
803 \begin{thebibliography}{01}
805 \bibitem{ws-glossary} Web Services Glossary, W3C Working Draft, 14 May 2003.\\
806 \url{http://www.w3.org/TR/2003/WD-ws-gloss-20030514/}
808 \bibitem{wsdlbindings} Web Services Description Language (WSDL)
809 Version 1.2: Bindings, W3C Working Draft, 24 January 2003.\\
810 \url{http://www.w3.org/TR/wsdl12-bindings/}
812 \bibitem{ws1}A. Armando, D. Zini. Interfacing Computer Algebra and
813 Deduction Systems via the Logic Broker Architecture. In Proceedings
814 of the Eighth Calculemus symphosium, St. Andrews, Scotland, 6--7 August 2000.
816 \bibitem{ws2} O. Caprotti. Symbolic Evaluator Service. Project Report of
817 the MathBrocker Project, RISC-Linz, Johannes Kepler University, Linz,
820 \bibitem{helm} A. Asperti, F. Guidi, L. Padovani, C. Sacerdoti Coen, I. Schena.
821 Mathematical Knowledge Management in HELM. In Annals of Mathematics and
822 Artificial Intelligence, 38(1): 27--46, May 2003.
824 \bibitem{omegaants1} C. Benzm\"uller, V. Sorge. O-Ants -- An Open Approach
825 at Combining Interactive and Automated Theorem Proving. In M. Kerber and
826 M. Kohlhase (eds.), Integration of Symbolic and Mechanized Reasoning, pp.
829 \bibitem{omegaants2} C. Benzm\"uller, M. Jamnik, M. Kerber, V. Sorge.
830 Agent-based Mathematical Reasoning. In A. Armando and T. Jebelean (eds.),
831 Electronic Notes in Theoretical Computer Science, (1999) 23(3), Elsevier.
833 \bibitem{omega} C. Benzm\"uller, L. Cheikhrouhou, D. Fehrer, A. Fiedler,
834 X. Huang, M. Kerber, M. Kohlhase, K. Konrad, E. Melis, A. Meier,
835 W. Schaarschmidt, J. Siekmann, V. Sorge. OMEGA: Towards a Mathematical
836 Assistant. In W. McCune (ed), Proceedings of the 14th Conference on
837 Automated Deduction (CADE-14), Springer LNAI vol. 1249, pp. 252--255,
838 Townsville, Australia, 1997.
840 \bibitem{ringboutin} S. Boutin. Using reflection to build efficient and
841 certified decision procedures. In Martin Abadi and Takahashi Ito, editors,
842 TACS'97, volume 1281. LNCS, Springer-Verlag, 1997.
844 \bibitem{maplemodeforCoq} David Delahaye, Micaela Mayero.
845 A Maple Mode for Coq. Contribution to the Coq library.\\
846 \url{htpp://coq.inria.fr/contribs/MapleMode.html}
848 \bibitem{MONET-Overview} The MONET Consortium, MONET Architecture Overview,
849 Public Deliverable D04 of the MONET Project.\\
850 \url{http://monet.nag.co.uk/cocoon/monet/publicsdocs/monet-overview.pdf}
852 \bibitem{mowglicic} C. Sacerdoti Coen. Exportation Module, MoWGLI Deliverable
854 \url{http://mowgli.cs.unibo.it/html\_no\_frames/deliverables/transformation/d2a.html}
856 \bibitem{ring} C. Sacerdoti Coen. Tactics in Modern Proof-Assistants: the
857 Bad Habit of Overkilling. In Supplementary Proceedings of the 14th
858 International Conference TPHOLS 2001, pp. 352--367, Edinburgh.
860 \bibitem{zack} S. Zacchiroli. \emph{Web services per il supporto alla
861 dimostrazione interattiva}, Master Thesis, University of Bologna, 2002.
863 \bibitem{ws3} J. Zimmer and M. Kohlhase. System Description: The MathWeb
864 Software Bus for Distributed Mathematical Reasoning.
865 In Proceedings of the 18th International Conference on Automated Deduction
866 CADE 18, LNAI 2392, Springer Verlag, 2002.
868 \bibitem{ws4} R. Zippel. The MathBus. In Workshop on Internet Accessible
869 Mathematical Computation at ISSAC'99, Vancouver, Canada, July 28--31, 1999.
871 \end{thebibliography}