Keyboard Shortcuts?

×
  • Next step
  • Previous step
  • Skip this slide
  • Previous slide
  • mShow slide thumbnails
  • nShow notes
  • hShow handout latex source
  • NShow talk notes latex source

Click here and press the right key for the next slide (or swipe left)

also ...

Press the left key to go backwards (or swipe right)

Press n to toggle whether notes are shown (or add '?notes' to the url before the #)

Press m or double tap to slide thumbnails (menu)

Press ? at any time to show the keyboard shortcuts

\title {Joint Action \\ Lecture 99}
 
\maketitle
 

Lecture 99

Joint Action

\def \ititle {Lecture 99}
\def \isubtitle {Joint Action}
\begin{center}
{\Large
\textbf{\ititle}: \isubtitle
}
 
\iemail %
\end{center}
 

Sharing a Smile

 
\section{Sharing a Smile}
 
\section{Sharing a Smile}
[THIRD POINT: smiling is a goal-directed action, the goal of which is to smile that smile]
My topic is sharing a smile. But first think about ordinary, individual actions like genuine smiles.
What distinguishes a genuine smile from a muscle spasm or the exhalation of wind?
I want to suggest that it's this: the smile is a goal-directed action where the goal is to simile that smile.
But why think of the smile as goal-directed? Because smiling the smile requires considerable motor coordination: it’s not a matter of simple muscle contractions but more like the production of a phonetic gesture where context affects what is needed to realise the smile.
Further, like grasping an object or articulating a particular phoneme, it is an action that can be realised by different bodily movements in different contexts.
This is why I put slides of two quite different but both genuine smiles.
[Objection:]
Now you might say that the smile can't be goal-directed because is isn't explicable by appeal to belief, desire and intention
This is because the genuine smile is spontaneous and not something that can be produced at will (although it could probably be inhibited, at least to some extent); after all, this is what distinguishes the genuine from the polite smile.
\footnote{
From web source: The Duchenne smile involves both voluntary and involuntary contraction from two muscles: the zygomatic major (raising the corners of the mouth) and the orbicularis oculi (raising the cheeks and producing crow's feet around the eyes). The zygomatic major can be voluntarily contracted but many people can't voluntarily contract the orbicularis oculi muscle.
}
So now we might be tempted by the view that a smile is merely caused by an emotion in the way that gasses can cause you to burp.
[Reply:]
Maybe there are smiles like this, but some genuine smiles are sustained.
And what sustains them is a process of controll
How could this be if such smiles are not consequences of beliefs, desires and intentions?
I think a reasonably natural view here is to think that part of what makes an event a smile, a goal-directed action and not just a muscle spasm caused by excess wind, is the way that motor control is involved. Specifically, the genuine smile will involve a motor representation of the outcome, the smile, and this motor representation will lead to movements by way of planning-like motor processes.
But you don't have to buy this to agree with me.
All you have to accept is that actions like some smilings can be goal-directed and controlled even in the absence of relevant beliefs, desires and intentions.
I think smiles fall into the category of actions like graspings, reachings and gesturings which are goal-directed but do not necessarily involve intention.
 
So far, then, I've suggested that smiling is a goal-directed action, the goal of which is to smile that smile.
Now imagine a situation where a single individual encounters and event (a clown’s falling) which causes amusement which causes her to smile
Note that the smile also modulates the emotion; if, for example, she supressed the smile, the quality of her amusement would change.
How could we gain insight into the fine-grained dynamics of others’ emotions?
How could we ever appreciate the unfolding of another’s grief, or the way their engagement leads to an explosion of ecstasy at the climax of a concert?
Part of the answer is obvious: by being there, with them.
[Not that this is the only possibility --- in some cases we might be told.]
But how exactly does being there, in the same situation help?
Merely being in the same situation is surely not enough.
It’s not enough that we each experience amusement, grief or ecstasy.
After all, individuals are different. Different individuals’ feelings don’t unfold in the same way just because they are in the same situation.
It’s just here that collective intentionality is relevant.
\textbf{What is involved in sharing a smile?}
Minimally, I think there have to be two kinds of connection between us for us to share a smile.
First, the way your smile unfolds is shaped by how mine unfolds and conversely.
I also suppose that our smiles can be minutely coordinated with each other.
But it’s not just that our smiles are interdependent in this way ...
It’s also that each of our smiles is shaping the way our amusement unfolds.
So the way your amusement unfolds is being controlled by, and controlling, the way mine unfolds.
In sharing a smile, we are emotionally locked together.
[*todo: remove motor stuff for this talk! Also: don't lose sight of idea that control is a way of knowing.]
[*todo: need slide with control arrows highlighted (my emotion controls yours).]
[*Structure: (i) I know because my emotion controls yours; (ii) But if my emotion controls yours, how can yours be amusement at the clown's falling? because control is partial, and reciprocal; (iii) But the mere fact of control isn't enough for knowledge; rather, control must show up in experience somehow. After all, for all I have said so far, we might, in sharing a smile, be unaware that our emotions are locked together. (iv) There must be an experience that is distinctive of sharing a smile. (iv) Note that I don’t want to say that someone who is sharing a smile needs to understand the situation in the way I’m describing it. All I'm claiming is that the fact of reciprocal control somehow affects our awareness. (v) It may affect in our awareness insofar as we are sensitive to contingencies between our own actions' and others' actions, and between our actions and the causes of them. (vi) So my position is this: the reciprocal control justifies each agent in making judgements about how the others' amusement is unfolding, and this justification is at least indirectly available to the agents by virtue of their having experiences characteristic of sharing a smile. ]
Our being emotionally locked together means that to a significant extent I am feeling what you are feeling, that the way my amusement is unfolding matches they way your amusement is unfolding. So if you know how your own amusement is unfolding and you know that we are emotionally locked together, you can know much about how my amusement is unfolding. So joint expressions of emotion like sharing a smile have the potential to enable us to know not just that others are amused but how their amusement is unfolding.
But the fact of reciprocal control (which means our emotions are locked together) together doesn’t all by itself mean that we can know how each other’s emotions are unfolding. After all, for all I have said so far, we might, in sharing a smile, be unaware that our emotions are locked together. Now you might think this sounds implausible because its hard to imagine sharing a smile without an experience that is distinctive of sharing a smile. And it might be natural to describe this experience as an experience of sharing. But even if that is correct, it’s necessary to say exactly why someone who is sharing a smile is in a position to know things about how the other’s emotion is unfolding.
I don’t want to say that interaction only helps if you know that your emotions are locked together. That is, I don’t want to say that someone who is sharing a smile needs to understand the situation in the way I’m describing it. But minimally the fact of reciprocal control must somehow feature in our awareness.
[*The idea in outline: \begin{enumerate} \item the ways our amusements unfold is locked together \item this is in part because a single motor plan has two functions, production of your smile and prediction of my smile \item the single motor process means that we might experience being locked together in some way (not that our emotions are locked together but that our actions are, in something like (but not exactly) we experience actions when seeing ourselves in a mirror or on CCTV (check Johannes’ discussion of this)). \end{enumerate} ]
Here I want to offer a wild conjecture. In joint expressions of emotion there is a single motor plan with two functions, production and prediction. The motor plan both produces your own smile and enables you to predict the way the other’s smile will unfold. [*missing step about monitoring and experience. (The Haggard idea: motor planning can give rise to experiences concerning one's own actions \citep{Haggard:2005sc}.)] Because your plan has this dual function, your experience of the other’s (my?) smile is special. From your point of view, it is almost as if the other is smiling your smile.% \footnote{ Joel caricatured this idea seeing me eating fruit: ‘it’s almost as if I’m eating that fruit.’ } This means that sharing a smile has characteristic phenomenology.
This odd phenomenological effect means that in sharing a smile we can each think of the situation almost as if there were a single smile. And almost as if there were a single state of amusement. (In thinking of the situation like this it is important that we have a subject-neutral conception of the amusement and an agent-neutral conception of the smile.% \footnote{ Tom Smith asked about this. I clarified that I wasn’t suggesting there was a state of amusement which is ours, nor that the subjects are thinking of the situation in this way. That’s the point of the appeal to subject-neutral amusement. It’s a partial model of the situation. } [*Here I think I’m shifting back from the perspective of the participants in sharing a smile to the perspective of the theorist. Probably what I should say is, first, that a theorist can think of the situation in this way and use this to argue, second, that there is a simple, partial conception of the situation that doesn’t require understanding reciprocal control and interlocking emotions but is sufficient for each smiler to have knowledge of the way the other’s emotion unfolds.] So my suggestion is that in sharing a smile you experience my smile almost as if it were yours (or: you experience me almost as if I were smiling your smile), and so you might also experience our situation almost as if it involved a single state of amusement.
It's more like we each plan a single smile.
But---to reply to the objection---these plans have a dual function. Your plan both produces your own smile and enables you to simulate---to experience---my smile. And likewise for my plan. The interdependence of our smilings means that we could each think of the situation as if it were one in which a single state of amusement were responsible for our actions.
 

Development of Joint Action: Planning

 
\section{Development of Joint Action: Planning}
 
\section{Development of Joint Action: Planning}
When are humans first able to do what Bratman calls ‘interconnected planning’?

Objection: ‘Despite the common impression that joint action needs to be dumbed down for infants due to their ‘‘lack of a robust theory of mind’’ ... all the important social-cognitive building blocks for joint action appear to be in place: 1-year-old infants understand quite a bit about others’ goals and intentions and what knowledge they share with others’

\citep[p.~383]{carpenter:2009_howjoint}.
Carpenter conflates goals and intentions, so ignores the key difference between actions and plans.

‘I ... adopt Bratman’s (1992) influential formulation of joint action or shared cooperative activity. Bratman argued that in order for an activity to be considered shared or joint each partner needs to intend to perform the joint action together ‘‘in accordance with and because of meshing subplans’’ (p. 338) and this needs to be common knowledge between the participants’

\citep[p.~381]{carpenter:2009_howjoint}.

Carpenter, 2009

So: the objection I just offered to taking Bratman’s account of shared intention and joint action to characterise the notion of joint action of interest in explaining development was narrowly theoretical.
The objection was that you can’t explain the developmental emergence of mindreading by invoking joint action if your account of joint action implies that abilities to perform joint actions presuppose sophisticated mindreading.
Accepting this theoretical objection would be consistent with accepting Carpenter’s claim. The only consequence is that we would have to reject the conjecture that you can explain the developmental emergence of mindreading by invoking joint action.
challenge
Explain the emergence of sophisticated human activities including referential communication and mindreading.
conjecture
Joint action plays a role in explaining how sophisticated human activities emerge.
objection
Joint action presupposes mindreading at the limits of human abilities.
So I think Carpenter is saying: the objection is correct, and we should reject the conjecture.

‘shared intentional agency [i.e. ‘joint action’] consists, at bottom, in interconnected planning’

Bratman, 2011 p. 11

‘shared intentional agency [i.e. ‘joint action’] consists, at bottom, in interconnected planning agency of the participants’ \citep{Bratman:2011fk}.

Paulus et al, 2016 figure 1

Task: give the tool to another person, who needs to put the spherical end into the box. (Tip: you need to grasp it by the spherical end and pass it so that the other takes the cube-end; they can then insert it optimally.)

Paulus et al, 2016 figure 2B

‘3- and 5-year-old children do not consider another person’s actions in their own action planning (while showing action planning when acting alone on the apparatus).

Seven-year-old children and adults however, demonstrated evidence for joint action planning. ... While adult participants demonstrated the presence of joint action planning from the very first trials onward, this was not the case for the 7-year-old children who improved their performance across trials.’

\citep[p.~1059]{paulus:2016_development}

Paulus et al, 2016 p. 1059

Warneken et al, 2014 figure 1A

‘One child had to insert the turn-tool on the right of the apparatus and then turn so that the metal rod stretching across moved the panel out of the way of the ball. The other person could then insert the push tool on the left, pushing the silver ball into the hole similar to a billiard cue.’ \citep{warneken:2014_young}

Warneken et al, 2014 figure 2

Unidirectional : child A has to select the tool that B doesn’t have.
Bidirectional : child A can select either tool.
‘(a) Unidirectional: The left box will be opened first. Only the left child has a choice. For success, this child has to choose the push tool (lower left: thick handle, long thin top). The partner child has to retrieve the only available turn tool (upper right: thick handle, short thin top).’ \citep{warneken:2014_young}

Warneken et al, 2014 figure 3

BU - first bidirectional then unidirectional.
The three year olds are hopeless in all conditions except the bidirectional condition when they have first had the unimanual condition. So there is no forward planning, but there is some evidence that three-year-olds can take into account what another has done.
‘by age 3 children are able to learn, under certain circumstances, to take account of what a partner is doing in a collaborative problem-solving context. By age 5 they are already quite skillful at attending to and even anticipating a partner’s actions’ \citep[p.~57]{warneken:2014_young}.

What is shared intention?

Functional characterisation:

shared intention serves to (a) coordinate activities, (b) coordinate planning and (c) structure bargaining

Constraint:

Inferential integration... and normative integration (e.g. agglomeration)

Substantial account:

We have a shared intention that we J if

‘1. (a) I intend that we J and (b) you intend that we J

‘2. I intend that we J in accordance with and because of la, lb, and meshing subplans of la and lb; you intend [likewise] …

‘3. 1 and 2 are common knowledge between us’

(Bratman 1993: View 4)

Note that the conditions require not just that we intend the joint action, but that we intend it because of each other's intentions, where this is common knowledge.
What we have seen suggests that even three-year-olds are probably incapable of meeting this condition ...
... and the idea that they need to know things about another’s intentions about their intentions seems like a bold conjecture indeed.

Mismatch:

Bratman’s account of joint action

vs

1- to 3-year-olds’ joint action abilities

All the evidence has suggests that there is a mismatch between Bratman’s account of joint action and 1- to 3-year-olds’ joint action abilities.
Let’s consider two more studies on this ...

Meyer et al, 2016 figure 1A

Child pass cups to adult who has one had occupied. Adult and child separated by glass window, so have to reach around.
‘Example of a 3-year-old child engaged in the joint-action task with an adult experimenter. (Top) The start of a trial in which the child has a cup and sees the experimenter holding the cup tower in her left hand. The lower two images illustrate the different possible response choices of the child. The child chooses in one trial to pass the cup on the side that is accommodating the other’s actions (middle); and in another trial, on the side that is not accommodating the execution of the partner’s actions (bottom).’ \citep{meyer:2016_planning}
‘To build one tower, children had to pass five cups. Then the experimenter introduced the next hand puppet with the same procedure. With each change of hand puppet, she switched sides by putting the puppet on her previously free hand ... in total four towers were built, two on each side.’ \citep{meyer:2016_planning}

Meyer et al, 2016 figure 1B

Meyer et al, 2016 figure 1C

Meyer et al, 2016 figure 2

‘two measures of interest: the initial response choice (i.e. the first trial only) and the continuous response choice (i.e. all trials except the first)’ \citep{meyer:2016_planning}
‘In their initial response choice, children had to plan ahead without having experienced how their own action would affect their joint- action partner. Thus, the initial response choice reflects whether children proactively planned to accommodate their joint-action partner.’ \citep{meyer:2016_planning}

Meyer et al, 2016 figure 3

3-year-olds and younger: not good at changing the side they pass the object on to take into account where the other will reach it.
‘children proactively plan their actions in a way that accommodates the actions of their partner early in childhood. By contrast, the flexible adjustment of their action plans to their partner only begins to develop in the fourth and fifth year of life. Notably, even at the age of 5 children only adjusted their action plans to a surprisingly small degree’ \citep[p.~8]{meyer:2016_planning}.
I think: the good results on the initial response choice at all ages suggest that some ability to take into account another’s immediate future actions may already be present at 2-and-a-half years of age (although this is not strong evidence).
Note that the success concerns anticipating ACTIONS not antcipating PLANS.

Mismatch:

Bratman’s account of joint action

vs

1- to 3-year-olds’ joint action abilities

All the evidence has suggests that there is a mismatch between Bratman’s account of joint action and 1- to 3-year-olds’ joint action abilities.
Let’s consider one more study on this ...

Gerson et al, 2016 figure 1B

Subjects: three-year-olds.
Four balls, four egg cups. Yellow balls must go in yellow egg cups, brown balls likewise. One ball is brown and yellow and can go in either colour egg cup. But you only have four egg cups, so you have to put it in whatever egg cup will enable you to place the other three balls correctly.
Experiment: compare performance in an individual condition (child does all) with performance in a joint condition (child alternates with puppet) and a control condition with a machine (not joint action, but similar turn-taking required.)
‘The joint play session consisted of nine trials. In the first, fourth, and seventh trial, Kip let the child place the first (and third) ball and Kip placed the second (multi-colored) and fourth ball. Kip always placed the multi-colored ball in the cup that allowed all forthcoming balls to be placed correctly. In the other six trials, Kip placed the first and third balls and the child placed the second and fourth balls. This ensured that the number of trials for which the child had to plan (by placing the multi-colored ball correctly) was matched across the individual and joint conditions.’ \citep{gerson:2016_social}

Gerson et al, 2016 figure 3

Results: When acting alone or alternating with a machine, three-year-olds were above chance at selecting the correct egg cup for the two-coloured ball. But in joint action they were not above chance, and their performance was significantly worse than when acting alone.
The results indicate that
‘proactive planning for two individuals, even when they share a common goal, is more difficult than planning ahead solely for oneself’ \citep[p.~128]{gerson:2016_social}.

How does Gerson et al’s study bear on the claim that Bratman’s account of shared intention and joint action characterises the social interactions children perform in the first two years of life?

What is shared intention?

Functional characterisation:

shared intention serves to (a) coordinate activities, (b) coordinate planning and (c) structure bargaining

Constraint:

Inferential integration... and normative integration (e.g. agglomeration)

Substantial account:

We have a shared intention that we J if

‘1. (a) I intend that we J and (b) you intend that we J

‘2. I intend that we J in accordance with and because of la, lb, and meshing subplans of la and lb; you intend [likewise] …

‘3. 1 and 2 are common knowledge between us’

(Bratman 1993: View 4)

As I said, All the evidence has suggests that there is a mismatch between Bratman’s account of joint action and 1- to 3-year-olds’ joint action abilities.
I started with Carpenter’s objection ...

Objection: ‘Despite the common impression that joint action needs to be dumbed down for infants due to their ‘‘lack of a robust theory of mind’’ ... all the important social-cognitive building blocks for joint action appear to be in place: 1-year-old infants understand quite a bit about others’ goals and intentions and what knowledge they share with others’

Carpenter conflates goals and intentions, so ignores the key difference between actions and plans.

‘I ... adopt Bratman’s (1992) influential formulation of joint action or shared cooperative activity. Bratman argued that in order for an activity to be considered shared or joint each partner needs to intend to perform the joint action together ‘‘in accordance with and because of meshing subplans’’ (p. 338) and this needs to be common knowledge between the participants’

Carpenter, 2009

It turns out, I think, that Carpenter is wrong to this extent. Whatever exactly one-year-olds mindreading abilities, they do not seem to be making much use of information about others’ intentions in performing joint actions. And so it is wrong to think of their abilities in terms of Bratman’s account of joint actions.
challenge
Explain the emergence of sophisticated human activities including referential communication and mindreading.
conjecture
Joint action plays a role in explaining how sophisticated human activities emerge.
objection
Joint action presupposes mindreading at the limits of human abilities.
So with respect to our overall line of enquiry, it may still be possible to hold on to the conjecture if we can overcome the objection.
 

Development of Joint Action: Years 1-2

 
\section{Development of Joint Action: Years 1-2}
 
\section{Development of Joint Action: Years 1-2}
What (if any) joint actions are humans capable of just at the point they are beginning to communicate referentially (typically around the first birthday)?

No planning ...

... So which joint actions can one- and two-year-olds perform?

‘By 12–18 months, infants are beginning to participate in a variety of joint actions which show many of the characteristics of adult joint action.’

\citep[p.~388]{carpenter:2009_howjoint}

Carpenter, 2009 p. 388

\citet{carpenter:2009_howjoint} points out that infants in the second year of life can perform a rich array of joint actions. But before we come to her insights, let’s step back and consider the first year.

4-6 months

dyadic interactions

Methods: still face; replay (infants detect whether caregiver reacts, so are less satisifed with a replay).
\citet[p.~196]{brownell:2011_early} comment: ‘infants become progressively tuned to the timing and structure of dyadic exchange’

6-12 months

triadic interactions

\citet[p.~197]{brownell:2011_early} comment: ‘adult-infant dyadic interactions expand to include objects, events, and individuals outside of the dyad (Moore and Dunham 1995)’

~ 12-24 months

infants initiate and re-start joint actions

e.g. ‘peek-a-boo; tickle; rhythmic games; chase’

Brownell, 2011

\citet[p.~197]{brownell:2011_early} comment: ‘Eventually, infants begin themselves to initiate joint action with adults and to respond in unique ways when adults violate their expectations for participation in the joint activity. For example, if a parent becomes distracted during peek-a-boo and fails to take her turn, 12-month olds may try to re-start the game by vocalizing to the adult or by re- enacting a well-rehearsed part of the game such as placing the cloth over their own face and waiting. One-year olds also begin to point to interesting sights and events to share their interest and affect and they expect adults to respond appropriately by looking (Liszkowski, et al 2006).’
‘infants learn about cooperation by participating in joint action structured by skilled and knowledgeable interactive partners before they can represent, understand, or generate it themselves. Cooperative joint action develops in the context of dyadic interaction with adults in which the adult initially takes responsibility for and actively structures the joint activity and the infant progressively comes to master the structure, timing, and communications involved in the joint action with the support and guidance of the adult. ... Eager participants from the beginning, it takes approximately 2 years for infants to become autonomous contributors to sustained, goal-directed joint activity as active, collaborative partners’ \citep[p.~200]{brownell:2011_early}.
‘Without the structure and scaffolding provided by the expert adult partner, 1-year-old children are unable to generate and sustain joint action with each other in the service of an external goal. By age two, however, they can do so readily, even with unfamiliar agemates and on novel, unfamiliar tasks’ \citep[p.~204]{brownell:2011_early}.

drumming together

(from two years of age; Endedijk et al, 2015)

Even the youngest children, 2-year-olds timed the start of a bust of drumming activity to coincide with the start of the other’s drumming. But only older children actually synchronized individual taps.
‘Children did not receive any instructions pertaining to drumming together or coordinating their drumming with their dyad partner.’ \citep{endedijk:2015_development}.
‘While 4-year-olds coordinated the timing of drum hits, children between 2- and 4 years of age showed indications of interpersonal coordination as indicated by the beginnings and endings of drumming bouts. Children showed more overlap in their bouts than would be expected by chance’ \citep[p.~720]{endedijk:2015_development}.
But note that children under four years of age provided no evidence that they synchronized the timings of their beats with each other (as measured by cross-correlation of inter-tap intervals).

Warneken and Tomasello, 2006

Warneken and Tomasello, 2006

Age: 18 months.
This spontaneous helping might be interpreted as initiating joint action. Note that in control conditions,
The videos are from \citep{Warneken:2006bs} but the more detailed study is \citep{Warneken:2006qe}.

Warneken and Tomasello, 2006

From \citep{Warneken:2006bs}. In the control condition, the human gave no signal of needing help. (‘For each task, there was a corresponding control task in which the same basic situation was present but with no in- dication that this was a problem for the adult (14). This ensured that the infant_s motivation was not just to reinstate the original situation or to have the adult repeat the action, but rather to actually help the adult with his problem. ... In control trials, he looked at the object with a neutral facial expression for 20 s. In no case did the infant receive any benefit (reward or praise) for helping. Each individual was tested in all 10 tasks, a subsample of 5 tasks administered as experimental and 5 as control conditions (in systematically varied order).’)

Warneken and Tomasello, 2007 figure 2 (part)

Ages: 14, 18 and 24 months.
Elevator task: free an object from a cylinder. Two roles. Role A: position yourself in the right location to retrieve the target object. Role B: push up the cylinder and hold it in place while another retrieves it.

Warneken and Tomasello, 2007 figure 3 (part)

‘The 14-month-olds of this study displayed coordinated behaviors in the elevator task Role A of positioning themselves in the right location and retrieving the target object from the cylinder when the partner pushed it up, but they had major problems performing Role B, pushing the cylinder up and holding it in place until the partner could fetch the object. If they pushed up the cylinder at all, they would repeatedly drop it when the other person was just about to take the object out’ \citep{warneken:2007_helping}.
\citet[p.~200]{brownell:2011_early} comment: ‘Across these non-routine tasks, 18-month olds’ behavior with the adult partner was rated as predominantly “uncoordinated” (vs. “coordinated” or “very coordinated”) and the children exhibited “low” cooperative engagement (vs. “medium” or “high”). On those tasks requiring children to anticipate the partner’s actions and to adjust their behavior accordingly, 18-month olds’ performance did not differ from chance. By age two, children operated at “medium” levels of cooperative engagement and were above chance in anticipating and coordinating their behavior with the adult.’
‘social coordinations show a marked improvement between children at 14 and 18 months of age’ \citep{warneken:2007_helping}.

Warneken and Tomasello, 2007 figure 2 (part)

Trampoline task: first bounce a cube on the trampoline with an experimenter, then experimenter pauses and we measure child’s attempts to re-engage.

Warneken and Tomasello, 2007 figure 4

The re-engagement is a powerful signal that children

Infants’ ‘attempts to reactivate the partner in interruption periods indicate that they were aware of the interdependency of actions—that the execution of their own actions was conditional on that of the partner’

‘these instances might also exemplify a basic understanding of shared intentionality

Warneken and Tomasello, 2007 pp. 290-1

Infants’ ‘attempts to reactivate the partner in interruption periods indicate that they were aware of the interdependency of actions—that the execution of their own actions was conditional on that of the partner ... these instances might also exemplify a basic understanding of shared intentionality’ \citep[p.~290--1]{warneken:2007_helping}.
Contrast Warneken and Tomasello with Brownell, who offers a different suggestion.
‘advances in infants’ ability to coordinate their behavior with one another are associated with multiple measures of developing self-other representations. One- and two-year olds’ symbolic representation of self and other in pretend play (e.g., pretending that a doll is feeding itself) was related to the amount of coordinated behavior they produced with a peer on the structured cooperation tasks described above (Brownell and Carriger 1990)’ \citep[p.~206]{brownell:2011_early}.
‘children who better produced and comprehended language about their own and others’ feelings and actions, and who could refer to themselves and others using personal pronouns likewise monitored their peer’s behavior more often and produced more joint activity with the peer (Brownell et al 2006)’ \citep[p.~206]{brownell:2011_early}.

Joint Action in Years 1-2

In the first and second years of life,

there is joint action

but it does not appear to involve planning agency

or shared intention.

Bratman’s account does not characterise

the sort of joint actions

infants perform in the first and second years of life.

Two-year-olds perform some joint actions but not others.
What distinguishes the joint actions they can perform from those they cannot?