国产人妻人伦精品_欧美一区二区三区图_亚洲欧洲久久_日韩美女av在线免费观看

合肥生活安徽新聞合肥交通合肥房產生活服務合肥教育合肥招聘合肥旅游文化藝術合肥美食合肥地圖合肥社保合肥醫院企業服務合肥法律

代做CSOCMP5328、代寫Python編程設計

時間:2024-05-19  來源:合肥網hfw.cc  作者:hfw.cc 我要糾錯



CSOCMP5**8 - Advanced Machine Learning 
Bias and Fairness in Large Language Models (LLMs) 
 
This is a group assignment, 2 to 3 students only. This is NOT an individual assignment. It is worth 
25% of your total mark. 
 
1. Introduction 
Generative AI models have garnered significant attention and adoption in various domains due to 
their remarkable output quality. Nevertheless, these models, reliant on massive, internet-sourced 
datasets, exhibit vulnerabilities that sparked a debate on important ethical concerns, especially 
around fairness, pertaining to the amplification of human biases and a potential decline in 
trustworthiness. 
 
This assignment aims to investigate methods for bias mitigation within generative AI models and 
provide your own method to mitigate the bias in the LLMs. While there are two main critical areas: 
Text-to-Text and Text-to-Image where fairness is paramount, our focus in this assignment is 
specifically on the Text-to-Text problem. 
● Text-to-Text using Large Language Models (LLMs): This area encompasses prominent 
language models such as Llama-2, BERT, T5, GPT-2/3, and Chat-GPT, and examines the 
potential for these models to generate biased textual content and its implications. 
1.1 Common biased categories 
To contextualise our investigation, we have identified several common categories of bias that 
may manifest within generative AI models: 
● Gender and Occupations: One significant aspect involves exploring biases related to 
gender disparities in various professions. By analysing the output of generative models, we 
can discern whether these models tend to associate specific careers more with one gender 
over another, thus potentially perpetuating occupational stereotypes, for example: 
○ Text-to-Text: GPT-2 may generate text that reinforces traditional gender 
stereotypes. For example, it might associate caregiving with women and leadership 
with men, perpetuating societal biases. Example: "She is a nurturing mother, 
always putting her family first." 
○ Text-to-Image: The results generated by Stable Diffusion for the prompt “A photo 
of a firefighter.”  
 
● Race / Ethnicity: Another critical dimension involves assessing biases related to race and 
ethnicity: 
○ Text-to-Text: GPT-2 may generate text that perpetuates racial stereotypes or 
generalisations about specific racial or ethnic groups, for example: "Asian people 
are naturally good at math." or the model may generate content that oversimplifies 
or misrepresents the cultures and traditions of certain racial or ethnic groups. for 
example: "All Latinos are passionate dancers." 
○ Text-to-Image: The bias results for “intelligent person” using Image Search 
Engines. 
 
 
Addressing bias and fairness in generative AI represents a complex and ongoing challenge. 
Researchers and developers are actively engaged in devising a range of techniques aimed at bias 
detection and mitigation. These approaches include the diversification of training data sources, the 
development of ethical guidelines for AI development, and the creation of algorithms designed 
explicitly to identify and rectify bias within AI-generated outputs. 
1.2 Safety 
Generative AI is used in intentionally harmful ways. This includes misusing generative AI to 
generate child sexual exploitation and abuse material based on images of children, or generating 
sexual content that appears to show a real adult and then blackmailing them by threatening to 
distribute it over the internet. Generative AI can also be used to manipulate and abuse people by 
impersonating human conversation convincingly and responding in a highly personalised manner, 
often resembling genuine human responses. 
Note: The resultant figures from Stable Diffusion are only presented to demonstrate the bias. This 
assignment is only for "text-based bias and fairness" in LLMs. 
 
2. A Guide to Using the Datasets 
To effectively investigate and assess bias within generative AI models for Text-to-Text, it is crucial 
to select appropriate datasets that reflect real-world scenarios and challenges. Depending on your 
chosen focus, you may need to find specific datasets for your area of investigation e.g., healthcare, 
sports, entertainment datasets etc. We provide some examples below however you are free to choose any dataset not listed. There are several datasets used for LLM bias evaluation [1], you 
may refer to this link for more information: https://github.com/i-gallegos/Fair-LLM-Benchmark. 
Those datasets are only used for evaluation, do not train your model with these datasets. 
 
Depending on your research objectives, select training datasets that align with your area of 
investigation. 
● Access the chosen datasets through official sources, research papers, or relevant 
repositories. 
● Download the training dataset (s) to your local environment. Ensure that you adhere to any 
licensing or usage terms associated with the dataset(s). Depending on the debiasing 
techniques employed, retraining the model may be necessary. Commonly utilised datasets 
for training LLMs such as Common Crawl, Wikipedia, BookCorpus, PubMed, arXiv, 
ImageNet, COCO, VQA, Flickr30k, etc. 
● Pre-process the dataset as necessary for compatibility with your chosen de-biasing (i.e., 
enabling fairness) methods in generative AI model. Consider factors like label imbalance 
among various demographic groups in the training data, as this can lead to bias. One 
common method for addressing bias is counterfactual data augmentation (CDA) [1] to 
balance labels. Additionally, other pre-processing techniques involve adjusting harmful 
information in the data or eliminating potentially biased texts. Identify and handle harmful 
text subsets using different methods to ensure a fairer training corpus. 
● Integrate the pre-processed dataset(s) into your code for training and evaluation. Ensure 
that you have the appropriate data loading and pre-processing routines in place to work 
seamlessly with generative AI models. 
 
Remember that data pre-processing and formatting are crucial steps in ensuring that the datasets 
are ready for input into your generative AI models. Additionally, make sure to document your 
dataset selection and pre-processing steps thoroughly in your research report for transparency and 
reproducibility. 
 
3. Performance Evaluations 
Most fairness metrics for LLMs can be categorised by what they use from the model such as the 
embeddings, probabilities, or generated text, including: 
● Embedding-based metrics: Using the dense vector representations to measure bias, which 
are typically contextual sentence embeddings. 
● Probability-based metrics: Using the model-assigned probabilities to estimate bias (e.g., to 
score text pairs or answer multiple-choice questions). 
● Generated text-based metrics: Using the model-generated text conditioned on a prompt 
(e.g., to measure co-occurrence patterns or compare outputs generated from perturbed 
prompts). 
 
 
 4. Tasks 
Your main tasks are: 
 
● Research: Conduct in-depth research to identify various methods for addressing bias in 
Generative AI. Ensure you understand the theoretical foundations and practical 
implementation of these methods. Provide comprehensive comparison of various methods 
based on the conducted evaluations and discuss their contributions, evaluation methods, 
strengths, and weaknesses (this will help in the Related Work section of the report). 
 
● Proposed Mathematical Model: 
○ Chose a language model such as Llama-2, BERT, T5, GPT-2/3, and Chat-GPT you 
would like to remove the bias. Write mathematical model for your proposed 
approach, represent training datasets as a database or feature sets etc., preprocessing
 steps that you have taken on the training datasets, the objective and 
optimisation method that you employed, training model using LLM, and evaluation 
metrics to evaluate your model. Write comprehensive table to show all the notations 
along with their descriptions. 
○ Write algorithms to show all the steps of the proposed approach, including system 
initialisation, training/testing, bias evaluations, results evolutions, or any other 
steps that show the implementation of your proposed approach. 
○ Show schematic representation of your proposed approach. 
● Code Development: 
○ Implement the selected bias mitigation methods, based on the proposed 
mathematical model. 
○ Train the model using selected LLM with the pre-processed dataset (if needed). 
○ Evaluate the bias, show experimental evaluations of various metrics, generate their 
corresponding figures. 
○ The code (including interfacing for training model using LLM and results 
evaluations) must be written in Python 3. You are allowed to use any external 
libraries for performance comparisons; however, you need to provide details on 
how the libraries were setup and how evaluation metrics were used, in the Appendix 
section. 
 
● Evaluation: 
○ Perform the chosen model before applying debiasing techniques on evaluation 
datasets and show if the bias exists via various prompts, these results are termed as 
the baseline. 
○ Pre-process the dataset and train the model using LLM using your proposed 
method. Evaluate the performance of the trained model via various prompts to 
demonstrate that you have addressed the bias. Note that, some debiasing techniques 
may not require retraining the model. 
○ Compare the performance of proposed method with the baseline. 
○ Evaluate other performance evaluation metrics, e.g., utility, training time, average, 
St. Dev etc. Note that some of the evaluation metrics might not be applicable in 
your proposed scenario, hence, you must actively think of various evaluation 
metrics to determine the applicability of your model; comprehensive literature survey will help you find how authors evaluated the bias and enabled fairness of 
generative AI models. 
○ Important: Please note that this is our understanding of how to carry out this study 
and evaluations i.e., show bias of chosen model via prompts à apply chosen 
debiasing technique (for example, pre-process the dataset (to remove imbalance 
labels and re-train model with pre-processed dataset) à via prompts, show that you 
have addressed the bias à compare baseline with proposed approach. If you think 
that this might not work, you need to come up with other techniques. 
 
● Conclude: 
○ Conclude your findings and show the strengths and weaknesses of your proposed 
approach. 
○ Provide hypothetical comparison of your approach with other approaches in the 
literature. This comparison could be based on various performance metrics. 
○ Provide future research directions about how to mitigate those weaknesses. 
○ Provide comprehensive directions on how your proposed model could be 
generalised and applicable for various application scenarios e.g., social media 
applications, stock markets, health or sports analytics etc. 
 
Note: Above steps are written with quite details. If you still have any ambiguity about those steps 
or implementation/technical questions or understanding of the problem scenario, then please do 
your own research, share your findings on the Ed so that other students could also get idea of how 
to deal with specific problem steps. Furthermore, please also post your concerns/questions no Ed 
under the “Assignment 2” thread, our teaching team will be happy to share their experience and 
suggestions. Please note that this is an open research assignment, use your own creativity and come 
up with the understanding of this problem scenario and solution. 
 
4.1 Report 
The report should be organised similar to research papers, and should contain at least the following 
sections: 
 
Abstract: 
• Clearly introduces the topic scenario and its significance. 
• Provides a concise summary of the proposed evaluation method. 
• Provide the results from various evaluation metrics. 
• Conclude your contributions and discuss its applicability in the real-world scenario. 
 
Introduction: 
• Clearly introduces the problem of bias in generative AI and its importance. 
• Provides a clear and detailed overview of the proposed methods. 
• Write contributions in detail e.g., pre-processing, experimental setup, mathematical 
model, proposed evaluation method and metrics, various steps to achieve evaluate your 
results. 
• Provide discussion on the key results and show the organisation of your report at the end 
of this section. 
 Related Work: 
• Provides a comprehensive review of related debiasing and fairness methods. 
• Discusses the advantages and disadvantages of the reviewed methods in the literature. 
• Demonstrates understanding of the existing literature. 
• Provide a summarised table of the existing works and show their contributions, evaluation 
method, strengths, and weaknesses of existing work. 
 
Proposed Method: 
• Explains the theoretical foundations of the proposed solution effectively. 
• Describes the details of debiasing methods clearly, including the objective function. 
• Presents the algorithmic representation of the proposed solution comprehensively. 
• Show schematic representation of your proposed approach. 
 
Experiments/Evaluations: 
• Provides a clear description of the experimental setup, including datasets, algorithm 
evaluations, and metrics. 
• Presents experimental results effectively, with appropriate figures. 
• Conducts a thorough analysis and comparison of baseline and proposed method. 
• Provides detailed insights on the results. 
 
Conclusion: 
• Effectively summarises the methods and results. 
• Provides valuable insights or suggestions for future work. 
• Provide strengths and weaknesses of your work, furthermore, provide future directions. 
 
References: 
• Lists all references, cited in the report. 
• Formats all references consistently and correctly. 
 
Appendix: 
• Provide instructions on how to run your code. 
• Provide additional/supporting figures or experimental evaluations. 
 
Note: Please follow the provided latex format for the report on Canvas. 
 
5. Submission guidelines 
1. Go to Canvas and upload the following files/folders compressed together as a zip file. 
● Report (a PDF file) 
The report should include all member’s details (student IDs and names). 
● Code (a folder): 
○ Algorithm (a sub-folder): Your code (could be multiple files or a project) ○ Input data (a sub-folder) Empty. Please do NOT include the dataset in the zip file 
as they are large. Please provide detailed instructions on how the datasets are used 
and how to download them. We will copy the dataset to the input folder when we 
test the code. 
2. A plagiarism checker will be used, both for code and report. 
3. A penalty of MINUS 20 percent marks (−20%) per day after the due date. The maximum 
delay is 5 (five) days, after that assignments will not be accepted. 
 
Note: Only one student needs to submit the zip file which must be renamed as student ID numbers 
of all group members separated by underscores, which should contain all the relevant files and 
report. E.g., “xxxxxxxx_xxxxxxxx_xxxxxxxx.zip”. Please write names and email addresses of 
each member in the report. 
 
 
Example References: 
1. Bias and Fairness in Large Language Models: A Survey. Isabel O. Gallegos, Ryan A. 
Rossi, Joe Barrow, Md Mehrab Tanjim, Sungchul Kim, Franck Dernoncourt, Tong Yu, 
Ruiyi Zhang, Nesreen K. Ahmed. https://arxiv.org/abs/2309.00770 
2. A Survey on Fairness in Large Language Models. Yingji Li, Mengnan Du, Rui Song, Xin 
Wang, Ying Wang. https://arxiv.org/abs/2308.10149 
3. Fair Diffusion: Instructing Text-to-Image Generation Models on Fairness. Felix Friedrich, 
Manuel Brack, Lukas Struppek, Dominik Hintersdorf, Patrick Schramowski, Sasha 
Luccioni, Kristian Kersting. https://arxiv.org/abs/2302.10893 
4. Stable Bias: Analyzing Societal Representations in Diffusion Models. Alexandra Sasha 
Luccioni, Christopher Akiki, Margaret Mitchell, Yacine Jernite. 
https://arxiv.org/abs/2303.11408 
 
 6. Marking Rubrics 
Criterion Marks Comments 
 
Coding (30 Marks): 
• Coding will be run to see whether it works properly and 
produces the figures and all evaluations demonstrated in 
the report. 
 
Abstract (5 Marks): 
• Clearly introduces the topic scenario and its 
significance. (1 Marks) 
• Provides a concise summary of the proposed evaluation 
method. (2 Marks) 
• Provide the results from various evaluation metrics. (1 
Marks) 
• Conclude your contributions and discuss its 
applicability in the real-world scenario. (1 Marks) 
 
Introduction (10 Marks): 
• Clearly introduces the problem of bias in generative AI 
and its importance. (3 Marks) 
• Provides a clear and detailed overview of the proposed 
methods. (3 Marks) 
• Write contributions in detail e.g., pre-processing, 
experimental setup, mathematical model, proposed 
evaluation method and metrics, various steps to achieve 
evaluate your results. (2 Marks) 
• Provide discussion on the key results and show the 
organisation of your report at the end of this section. (2 
Marks) 
 
Related Work (10 Marks): 
• Provides a comprehensive review of related debiasing 
and fairness methods. (3 Marks) 
• Discusses the advantages and disadvantages of the 
reviewed methods in the literature. (3 Marks) 
• Demonstrates understanding of the existing literature. (2 
Marks) 
• Provide a summarised table of the existing works and 
show their contributions, evaluation method, strengths, 
and weaknesses of existing work. (2 Marks) 
 
 
 
  
Proposed Method (20 Marks): 
• Explains the theoretical foundations of the proposed 
solution effectively. (7 Marks) 
• Describes the details of debiasing methods clearly, 
including the objective function. (4 Marks) 
• Presents the algorithmic representation of the proposed 
solution comprehensively. (7 Marks) 
• Shows schematic representation of proposed approach. 
(2 Marks) 
 
Experiments/Evaluations (20 Marks): 
• Provides a clear description of the experimental setup, 
including datasets, algorithm evaluations, and metrics. 
(7 Marks) 
• Presents experimental results effectively, with 
appropriate figures. (7 Marks) 
• Conducts a thorough analysis and comparison of 
baseline and proposed method. (4 Marks) 
• Provides detailed insights on the results. (4 Marks) 
 
Conclusion (5 Marks): 
• Effectively summarises the methods and results. (1 
Marks) 
• Provides valuable insights or suggestions for future 
work. (2 Marks) 
• Provide strengths and weaknesses of your work, 
furthermore, provide future directions. (2 Marks) 
 
References: 
• Lists all references, cited in the report. 
• Formats all references consistently and correctly. 
 
Overall Presentation (10 Marks): 
• Maintains a clear and logical structure throughout the 
report. (5 Marks) 
• Demonstrates excellent writing quality, including clarity 
and coherence. (3 Marks) 
• Adheres to formatting and citation guidelines 
consistently. (2 Marks) 
 
Total: 100 Marks 


 請加QQ:99515681  郵箱:99515681@qq.com   WX:codinghelp









 

掃一掃在手機打開當前頁
  • 上一篇:菲律賓移民北美的條件(移民材料是什么)
  • 下一篇:代做CSC 4120、代寫Python程序語言
  • 無相關信息
    合肥生活資訊

    合肥圖文信息
    流體仿真外包多少錢_專業CFD分析代做_友商科技CAE仿真
    流體仿真外包多少錢_專業CFD分析代做_友商科
    CAE仿真分析代做公司 CFD流體仿真服務 管路流場仿真外包
    CAE仿真分析代做公司 CFD流體仿真服務 管路
    流體CFD仿真分析_代做咨詢服務_Fluent 仿真技術服務
    流體CFD仿真分析_代做咨詢服務_Fluent 仿真
    結構仿真分析服務_CAE代做咨詢外包_剛強度疲勞振動
    結構仿真分析服務_CAE代做咨詢外包_剛強度疲
    流體cfd仿真分析服務 7類仿真分析代做服務40個行業
    流體cfd仿真分析服務 7類仿真分析代做服務4
    超全面的拼多多電商運營技巧,多多開團助手,多多出評軟件徽y1698861
    超全面的拼多多電商運營技巧,多多開團助手
    CAE有限元仿真分析團隊,2026仿真代做咨詢服務平臺
    CAE有限元仿真分析團隊,2026仿真代做咨詢服
    釘釘簽到打卡位置修改神器,2026怎么修改定位在范圍內
    釘釘簽到打卡位置修改神器,2026怎么修改定
  • 短信驗證碼 豆包網頁版入口 破天一劍 目錄網 排行網

    關于我們 | 打賞支持 | 廣告服務 | 聯系我們 | 網站地圖 | 免責聲明 | 幫助中心 | 友情鏈接 |

    Copyright © 2025 hfw.cc Inc. All Rights Reserved. 合肥網 版權所有
    ICP備06013414號-3 公安備 42010502001045

    国产人妻人伦精品_欧美一区二区三区图_亚洲欧洲久久_日韩美女av在线免费观看
    中文字幕无码精品亚洲资源网久久| 国产精品高清一区二区三区| 久久久国产成人精品| 亚洲尤物视频网| 精品少妇人妻av一区二区| 国产mv久久久| 亚洲va码欧洲m码| 成人精品久久一区二区三区| 国产精品久久久久不卡| 日韩精品 欧美| 国产黄色特级片| 少妇免费毛片久久久久久久久| 成人久久久久久久久| 欧美麻豆久久久久久中文| 任我爽在线视频精品一| 久久人人看视频| 亚洲人成无码www久久久| 国产免费久久av| 欧美激情视频在线免费观看 欧美视频免费一 | 欧美久久久久久久| 久久免费视频在线观看| 亚洲巨乳在线观看| 隔壁老王国产在线精品| 色综合久久悠悠| 成人在线免费观看一区| 亚洲字幕在线观看| www.av毛片| 久久久久久国产精品| 国产欧美精品一区二区三区-老狼| 国产精品久久久久久久av电影| 虎白女粉嫩尤物福利视频| 国产精品视频久久| 国产一区在线播放| 欧美激情精品久久久久久变态| 国产美女永久无遮挡| 这里只有精品66| 91精品国自产在线观看| 春日野结衣av| 国产成一区二区| 日本一二三区视频在线| 久久久av水蜜桃| 人妻夜夜添夜夜无码av| 久久久久久久久久国产精品| 欧美亚洲精品日韩| 久久国产精品99国产精| 国产伦精品一区二区三区免| 亚洲精品国产精品国自产| 国产v片免费观看| 黄色片一级视频| 欧美精品久久久久久久| 久久人人爽人人爽人人av| 欧美在线日韩在线| 国产精品果冻传媒潘| 高清不卡一区二区三区| 色乱码一区二区三在线看| 国产精品视频在线免费观看| 国产日韩欧美中文| 亚洲一区不卡在线| 日韩在线欧美在线| 国产精品自拍网| 欧洲成人在线视频| 色综合视频网站| 久艹在线免费观看| 国产在线日韩在线| 日韩国产精品一区二区| 国产精品成人一区二区三区吃奶| 91精品中文在线| 欧美精品一区在线| 中文精品视频一区二区在线观看| 91免费看蜜桃| 女同一区二区| 亚洲天堂第一区| 日韩在线视频一区| 国产日韩欧美成人| 日本免费一区二区三区视频观看| 久久不射电影网| 国产成人一区二| 国产免费一区二区视频| 日本不卡二区| 一区二区精品免费视频| 久久精品久久久久久| 97精品一区二区三区| 韩国欧美亚洲国产| 亚洲a一级视频| 国产精品成人va在线观看| 国产精品 欧美在线| 国产熟人av一二三区| 性高湖久久久久久久久aaaaa| 国产精品视频白浆免费视频| 91国产精品电影| 国产专区一区二区三区| 日本中文字幕亚洲| 在线视频亚洲自拍| 国产精品精品久久久| 久久精品久久精品国产大片| 粉嫩av一区二区三区天美传媒| 男女午夜激情视频| 日韩啊v在线| 午夜精品久久久内射近拍高清| 久久亚洲一区二区三区四区五区高| 国产第一区电影| 91精品在线影院| 国产在线拍偷自揄拍精品| 日韩欧美黄色大片| 亚洲欧美日韩国产成人综合一二三区| 久久夜色精品亚洲噜噜国产mv| 久久久久久伊人| 国产精品av免费| 国产精品一 二 三| 国产自产精品| 精品一区二区视频| 欧日韩在线观看| 日本乱人伦a精品| 偷拍视频一区二区| 亚洲欧美精品| 亚洲综合中文字幕在线| 久操成人在线视频| 国产精品久久久久久久久影视| 精品国产一区av| 久久久久免费看黄a片app| 国产成人一二三区| 国产精品18久久久久久首页狼| www.亚洲视频.com| 不卡视频一区| 99视频国产精品免费观看| 国产精品一区专区欧美日韩| 国产尤物91| 国产人妻互换一区二区| 国产日韩中文字幕| 国产视频一区二区三区四区| 美女精品国产| 国产伦视频一区二区三区| 国产青青在线视频| 国产尤物av一区二区三区| 国产香蕉一区二区三区| 国产日韩欧美电影在线观看| 国产乱人伦精品一区二区三区 | 国产精品一线二线三线| 古典武侠综合av第一页| 91麻豆国产精品| 久久亚洲免费| 丝袜一区二区三区| 国产精品久久久一区| 欧美成年人在线观看| 国产999在线观看| 亚洲中文字幕无码专区| 日韩影院一区| 欧美在线一区二区视频| 美女日批免费视频| 福利视频一二区| 国产福利成人在线| 久久韩国免费视频| 久久福利网址导航| 亚洲精品视频一区二区三区 | 久久综合五月天| 亚洲综合av影视| 日韩欧美亚洲日产国产| 国内精品小视频在线观看| 麻豆91av| 91免费国产视频| 日韩在线高清视频| 欧美日韩国产二区| 亚洲欧美成人一区| 欧洲精品在线一区| 国产精品一区二区久久久| 久久久久高清| 国产精品福利在线观看网址| 在线观看免费黄色片| 日本国产一区二区三区| 国内精品视频一区二区三区| 国产精品一区二区在线| 久久久免费高清电视剧观看| 久久久精品国产网站| 欧美激情亚洲激情| 日本一级黄视频| 国产一区高清视频| 久久亚洲中文字幕无码| 国产精品久久久久久av福利软件| 亚洲午夜久久久影院伊人| 青青青在线视频播放| 国产精选一区二区| 日韩在线欧美在线| 亚洲一区二区中文| 激情综合网婷婷| 久久在线中文字幕| 久久av在线看| 青青草成人在线| 99精品99久久久久久宅男| 国产成人免费91av在线| 一级一片免费播放| 国内免费久久久久久久久久久| 久久涩涩网站| 伊人久久在线观看| 欧美激情专区| 久久精品在线免费视频| 精品久久久久久无码国产| 欧洲精品在线视频| 国产极品尤物在线| 中文字幕一区二区三区四区五区| 欧美性大战久久久久xxx|