Descendant models = models fine tuned from an ancestor on one particular domain, e.g. by partitioning your training data by subject or source
Democracy = some weighted mix of the descendant models is used to find the next token