28 lines
957 B
Plaintext
28 lines
957 B
Plaintext
"In practice, we found that a high-entropy initial state is more likely to increase the speed of training.
|
|
The entropy is calculated by:
|
|
$$H=-\sum_{k= 1}^{n_k} p(k) \cdot \log p(k), p(k)=\frac{|A_k|}{|\mathcal{A}|}$$
|
|
where $H$ is the entropy, $|A_k|$ is the number of agent nodes in $k$-th cluster, $|\mathcal{A}|$ is the total number of agents.
|
|
To ensure the Cooperation Graph initialization has higher entropy,
|
|
we will randomly generate multiple initial states,
|
|
rank by their entropy and then pick the one with maximum $H$."
|
|
|
|
```
|
|
FROM ubuntu:latest
|
|
|
|
RUN apt-get update && \
|
|
apt-get install -y python3 python3-pip && \
|
|
rm -rf /var/lib/apt/lists/*
|
|
|
|
RUN echo '[global]' > /etc/pip.conf && \
|
|
echo 'index-url = https://mirrors.aliyun.com/pypi/simple/' >> /etc/pip.conf && \
|
|
echo 'trusted-host = mirrors.aliyun.com' >> /etc/pip.conf
|
|
|
|
RUN pip3 install gradio requests[socks] mdtex2html
|
|
|
|
COPY . /gpt
|
|
WORKDIR /gpt
|
|
|
|
|
|
CMD ["python3", "main.py"]
|
|
```
|