No reference needed further than the Wikipedia definition:
words and phrases, such as “me” or “here”, that cannot be fully understood without additional contextual information — in this case, the identity of the speaker (“me”) and the speaker’s location (“here”). Words are deictic if their semantic meaning is fixed but their denotational meaning varies depending on time and/or place. Words or phrases that require contextual information to convey any meaning – for example, English pronouns – are deictic.
then is anaphoric, but it is also deictic: the time that it refers to depends on the time that the speaker is speaking. You could argue that its reference is fixed if it is a historical narrative: “Caesar crossed the Rubicon. Then he began the Roman civil war”—refers to 49 BC whether spoken in 2000 or 2050. But in “I will eat, and then I will go to bed”, the time of the then is quite different depending on when the phrase is spoken, because the clause it references itself has its time anchored on the speaker.
If the deictic centre is not fixed but egocentric, then the expression is deictic, even if it’s primarily anaphoric (as then is). The denotational meaning varies depending on time and/or place.