Weighted Belief updates #49

weihangzheng · 2024-02-10T17:33:36Z

Weighted Belief Changes

Modifications Implemented:

Each agent is now equipped with:
- A list of truthful weights.
- A vector of scalar values (N), representing the probability that each agent considers another agent to be truthful, ranging from 0 to 1.
Recent Communications:
- A dictionary that captures the set of the most recent communicated messages (2xN) exchanged between agents.
- The structure is designed to incorporate data from previous iterations, facilitating the use of a sequential model (RNN, LSTM, Transformer).
Adaptive Model:
- Processes a (2xN) collection of communicated messages (Recent Communications) from one agent to another and predicts a probability between 0 and 1. This prediction is used to revise the list of truthful weights.
- While a single model could be utilized for all agents, given that each agent shares a similar noise level or belief vector with all other agents, the current approach separates the models to allow for future complexity enhancements.
- The model's architecture can be expanded to process a (2xNxk) dataset, where k represents the number of previous iterations to consider. Alternatively, a state-based memory model, such as an RNN, LSTM, or Transformer, could be utilized to manage temporal data more effectively.
Belief Update Mechanism:
- The previous method of stripping extreme values, denoted as strip D, has been replaced.
- Now, each incoming message regarding a specific agent's position is weighted according to the truthful weights list, followed by a normalization step (weighted sum then normalize).
- This adjustment results in a linear time complexity with respect to the number of agents (with a constant factor being the model’s evaluation time), as opposed to O(n log(n)) required previously for sorting and stripping D values.

Notes on Model Development

Intuition Behind the Model:
- Agents with adversarial intentions tend to send more distorted data, which the model aims to identify and filter out.
- There is an option to pre-train the model using either high-quality synthetic data or data gathered from actual gameplay, although this may conflict with the goal of minimizing the information provided to the agents.
- An alternative approach is to iteratively evaluate and train the model based on ongoing gameplay data.
  - This approach necessitates the acquisition of retrospective, fairly accurate ground truth regarding whether each agent is truthful or adversarial.
- To conserve computational resources, the model could be trained and assessed periodically rather than after every iteration.
  - The primary focus is to be placed on the exploration and refinement of the model.

weihangzheng added 24 commits February 9, 2024 02:12

setting up each agent to store past messages from all agents

268493d

fix loop

9151e24

adding probability model

48345c2

connected toy model weight update to nepiada pipeline

697d21d

updating beliefs via weight instead of D value

7484882

added base framework for model based belief updates

eb5c51a

saved model as file and loaded it into pipeline

f39b1e2

code to generate and a real dataset itself

6c06e25

added fitting of the model to the real data

2df0b7a

harded but looking back k iterations makes the hyperplane split better

aa7f8b9

POC of hyperplane split

d2ca2dc

make tried longer stretches

9232879

fix stuff

15b6d72

generalization comparison script

811b01d

new model testing pipelines and datasets

2f75e33

datasets

35f21d3

k-means promising

e89e41a

feature ann

3f19c72

trying constant and mixed adv

9569f01

fix online k-means

67f316d

E2E works almost

9d4c6f9

E2E works

6a0b629

plot.py

d47f080

parallel batch testing

d542282

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Weighted Belief updates #49

Weighted Belief updates #49

weihangzheng commented Feb 10, 2024

Weighted Belief updates #49

Are you sure you want to change the base?

Weighted Belief updates #49

Conversation

weihangzheng commented Feb 10, 2024

Weighted Belief Changes

Modifications Implemented:

Notes on Model Development