Evaluation Benchmark Details #1

praeclarumjj3 · 2024-03-24T03:43:51Z

Hi, thanks for your work!

Do you plan to release the code and data used during the evaluation, particularly the question-answer pairs for the Q&A, ground truths for event summarization, and the multimodal dialogue generation tasks?

I looked at the LoCoMo dataset release, and it only contains the JSONs corresponding to the 50 conversations. Please let me know if I missed something in those JSONs.

yhshu · 2024-05-03T14:56:53Z

Thank the authors for the great work! I'd also like to ask if you have plans to release the full benchmark.

LeonNerd · 2024-06-06T10:06:19Z

+1

lightislost · 2024-06-12T03:03:23Z

+1

deadpool66 · 2024-06-18T11:43:29Z

+1

adymaharana · 2024-08-02T16:49:39Z

Hi everyone,

Thank you so much for your patience! We are happy to know that our work has been of interest. We have released our dataset with annotations; please see data/locomo10.zip in this repository for the evaluation benchmark that is released as part of the ACL 2024 version of our paper. We sub-sampled our previous release of 50 conversations to retain the longest conversations (see details in Note). We will be updating the Arxiv paper with the results on this subset in the following week and also releasing the code for evaluating open-source and closed-source LLMs on all tasks in LoCoMo. Please let me know if you face any issues or discrepancies in the data (and the code release in the following week).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation Benchmark Details #1

Evaluation Benchmark Details #1

praeclarumjj3 commented Mar 24, 2024

yhshu commented May 3, 2024

LeonNerd commented Jun 6, 2024

lightislost commented Jun 12, 2024

deadpool66 commented Jun 18, 2024

adymaharana commented Aug 2, 2024

Evaluation Benchmark Details #1

Evaluation Benchmark Details #1

Comments

praeclarumjj3 commented Mar 24, 2024

yhshu commented May 3, 2024

LeonNerd commented Jun 6, 2024

lightislost commented Jun 12, 2024

deadpool66 commented Jun 18, 2024

adymaharana commented Aug 2, 2024