国产人妻人伦精品_欧美一区二区三区图_亚洲欧洲久久_日韩美女av在线免费观看

合肥生活安徽新聞合肥交通合肥房產(chǎn)生活服務(wù)合肥教育合肥招聘合肥旅游文化藝術(shù)合肥美食合肥地圖合肥社保合肥醫(yī)院企業(yè)服務(wù)合肥法律

代寫CS257、c/c++編程設(shè)計(jì)代做

時(shí)間:2024-02-29  來(lái)源:合肥網(wǎng)hfw.cc  作者:hfw.cc 我要糾錯(cuò)



CS257 Advanced Computer Architecture
Coursework Assignment
Term 2, 2023/24
Contents
1 Introduction 2
2 Submission 2
3 Introduction to ACACGS 3
4 Compiling and Running the Code 4
4.1 Visualisation Generation . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4
5 Hardware Details 6
6 How will my code be tested for performance? 7
7 Rules 7
8 Where do I start? 7
9 Instructions for Submission 7
10 Support 7
1
1 Introduction
The purpose of this coursework is to give you some hands-on experience in code optimisation. By the time you read
this, you will have encountered a variety of code optimisation techniques including loop unrolling and vectorisation.
2 Submission
Your submission will consist of two parts:
1. Optimised Code (60%)
A piece of C code based on the initial implementation provided. This C code will be assessed with respect
to your selection and understanding of optimisations, functional correctness, i.e., producing the right answer,
and execution speed.
2. Written Report (40%)
A report (4 pages maximum, excluding references) detailing your design and implementation decisions. Your
report will be evaluated with respect to your understanding of code optimisation techniques and the optimisations you attempted. This means that your report should explain:
(a) which optimisations you did and did not use;
(b) why your chosen optimisations improve performance; and
(c) how your chosen optimisations affect floating-point correctness.
Given that you may apply many different optimisations, a sensible approach is to build your solution incrementally, saving each partial solution and documenting the impact of each optimisation you make. This means that it
is in your interest to attempt as many different optimisations or combinations of optimisations as you can.
You may discuss optimisation techniques with others but you are not allowed to collaborate on solutions to this
assignment. Please remember that the University takes all forms of plagiarism seriously.
2
3 Introduction to ACACGS
ACSCGS is a conjugate gradient proxy application for a 3D mesh. The simulation will execute for either a fixed
number of timesteps or alternatively until the residual value falls below a given threshold. This is done for a given
mesh size, which is passed in at runtime through command-line arguments.
In this proxy application, a force is applied to each edge boundary of the cuboid, which is then propagated
throughout the mesh. As each time step passes, the force is dissipated within the mesh, until the amount of residual
is significantly small that the simulation stops (as there are no more calculations to perform), or a set number of
time steps have passed.
In addition to providing numeric solutions, the code can also generate visuals which depict the pressure within
the mesh throughout the simulation run. Creating the visualisations relies on two optional packages, Silo and VisIt,
which are available on the DCS systems.
Figure 1: Pressure Matrix Visualisation
3
4 Compiling and Running the Code
The code includes a Makefile file to build the program. You can compile all of the code using the command make.
You should not modify the Makefile file, but examining it may prove helpful in some situations.
While the DCS machines do include a version of gcc, it is preferable to use a more recent version. On the DCS
systems, you can make version 9 the default by using the module load gcc9 command. Once this is loaded you
can simply type make to build the code, which will create an executable named acacgs in the directory. To clean
up the directory, you can run make clean.
To run the code, you need to provide the three dimensions for the mesh as three parameters to the executable.
For example to execute the provided code on a small 10x10x10 mesh you would enter ./acacgs 10 10 10. On my
system the output for the code is below. This information is also stored in a file, which is named after the wallclock
date and time of when the program was first executed (for example, 2023_01_26_12_00_00.txt).
===== Final Statistics =====
Executable name: ./acacgs
Dimensions: 10 10 10
Number of iterations: 149
Final residual: 2.226719e-92
=== Time ==
Total: 1.126600e-02 seconds
ddot Kernel: 8.3**000e-04 seconds
waxpby Kernel: 1.087000e-03 seconds
sparsemv Kernel: 9.123000e-03 seconds
=== FLOP ==
Total: 9.536000e+06 floating point operations
ddot Kernel: 5.960000e+05 floating point operations
waxpby Kernel: 8.940000e+05 floating point operations
sparsemv Kernel: 8.046000e+06 floating point operations
=== MFLOP/s ==
Total: 8.464**e+02 MFLOP/s
ddot Kernel: 7.103695e+02 MFLOP/s
waxpby Kernel: 8.224**1e+02 MFLOP/s
sparsemv Kernel: 8.819467e+02 MFLOP/s
Difference between computed and exact = 1.110223e-15
You will find more detailed instructions to build the code in the README.md file, including flags to turn on
verbose mode, which will output details for each timestep in the simulation, and flags for enabling visualisation.
4.1 Visualisation Generation
To enable visualisation outputs, you must build your code using make SILO=1. This will then compile your code
in a way which produces files suitable for visualisation in VisIt. If you are working remotely and want to visualise
the coursework, it will be quicker and easier for you to copy the files to your local machine, then utilise VisIt on
the local machine to visualise the cuboid. Before you make the program, make sure you load the SILO module
(module load cs257-silo).
When the program is ran with visualisations, each timestep will produce a SILO file within a directory named
after the wallclock date and time (for example: 2023_01_26_12_00_00). In this directory will be a collection of
.silo files, each named outputXXXX.silo, where XXXX represents the timestep it relates to.
Once the program has finished, these can be utilised in Visit. To do so, load the VisIt module (module load
cs257-visit) and open VisIt using the command visit. From here, you will get 2 windows. The smaller, skinner
one is the control window and is used to manage everything that will be displayed. The larger window is the display
window. In the control window, select Open, and navigate to the directory with the SILO files. You should then
be able to select these SILO files.
4
Now that the SILO files have been loaded, we can now draw some given variables. To do this, click on the Add
and select a mode and a variable that should be viewed. One of the nicest ones to use is Volume and either x_nodal
or p_nodal. When you have finished adding elements, click on Draw. This will generate an image in the display
window, that can be dragged around so that the cuboid can be viewed from different angles. The control window
has a play button, which will run through each timestep.
Visualisations are nice to have, but for performance purposes we turn them off as they write a significant amount
of data to disk.
Table 1: Visualisation Data File Sizes
x y z Cells Approximate Data Size
10 10 10 1000 4MB
25 25 25 15,625 39MB
50 50 50 125,000 301MB
100 100 100 1,000,000 2.4GB
200 200 200 8,000,000 19.3GB
There is the potential to go significantly over your DCS disk quota with large meshes. I recommend that you
do not exceed 30x30x30 for producing visualisations on the DCS machines. If you are developing your solution on
your personal machine then you may wish to produce larger visualisations.
5
5 Hardware Details
On a Linux system, you can read the processor information using the command cat /proc/cpuinfo or lscpu.
This will provide full details on the CPU in the machine, including the CPU model, number of cores, the clock
frequency and supported extensions. I strongly recommend taking a look at this on your development machine.
For the purposes of assessment, your code will be run on a DCS machine with 4 cores. The output from lscpu
can be seen below:
Architecture: x86_64
CPU op-mode(s): **-bit, 64-bit
Byte Order: Little Endian
CPU(s): 4
On-line CPU(s) list: 0-3
Thread(s) per core: 1
Core(s) per socket: 4
Socket(s): 1
NUMA node(s): 1
Vendor ID: GenuineIntel
CPU family: 6
Model: 158
Model name: Intel(R) Core(TM) i5-7500 CPU @ 3.40GHz
Stepping: 9
CPU MHz: 3400.000
CPU max MHz: 3800.0000
CPU min MHz: 800.0000
BogoMIPS: 6816.00
Virtualization: VT-x
L1d cache: **K
L1i cache: **K
L2 cache: 256K
L3 cache: 6144K
NUMA node0 CPU(s): 0-3
Flags: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36
clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe syscall nx pdpe1gb rdtscp lm
constant_tsc art arch_perfmon pebs bts rep_good nopl xtopology nonstop_tsc cpuid
aperfmperf tsc_known_freq pni pclmulqdq dtes64 monitor ds_cpl vmx smx est tm2 ssse3
sdbg fma cx16 xtpr pdcm pcid sse4_1 sse4_2 x2apic movbe popcnt tsc_deadline_timer aes
xsave avx f16c rdrand lahf_lm abm 3dnowprefetch cpuid_fault invpcid_single pti ssbd
ibrs ibpb stibp tpr_shadow vnmi flexpriority ept vpid ept_ad fsgsbase tsc_adjust bmi1
avx2 smep bmi2 erms invpcid mpx rdseed adx smap clflushopt intel_pt xsaveopt xsavec
xgetbv1 xsaves dtherm ida arat pln pts hwp hwp_notify hwp_act_window hwp_epp md_clear
flush_l1d arch_capabilities
Machines matching this specification are available in the cs257 queue of the Batch Compute System in the
Department (referred to as kudu in the labs). You will learn how to use this system during the lab sessions, so
there will be time to get used to it.
6
6 How will my code be tested for performance?
Your submission will be tested on a range of input sizes to evaluate how robust your performance improvements
are. It is recommended that you try testing your solution on inputs that are not cubes to see if there are any
weaknesses in your optimisation strategies. The 7-pt stencil option will not be used for testing your code.
Your code will be executed five times for each problem size on the target hardware. The highest and lowest
runtimes will be discarded, and the mean of the three remaining values will be taken as your runtime for that
problem size.
7 Rules
Your submitted solution must:
• Compile on the DCS workstations.
Your submitted solution must not:
• Alter the Makefile or add or edit any compiler flags;
• Use instruction sets not supported by the DCS machines;
• Require additional hardware e.g., GPUs;
• Add relaxed math options to the compile line, e.g., -ffast-math. Note: Manual use of approximate math
functions is acceptable.
8 Where do I start?
This can seem like a daunting project, but we can break it down into a number of steps.
1. Compile and run the code as provided. This is a quick easy check to make sure your environment is setup
correctly.
2. Read the code. Start in main.c and follow it through. The functions are well documented with Doxygen
comments. Don’t panic - you are not expected to understand the physics in the code.
3. Measure the runtime of the code for reference purposes.
4. Figure our where the most intensive sections of code are.
5. Develop a small optimisation.
6. Run the code and review the impact of your changes.
7. Repeat steps 5 and 6 until you have exhausted your performance ideas.
9 Instructions for Submission
Your solution should be submitted using Tabula. Please ensure that your code works on DCS machines prior to
submission.
Submission Deadline: Wednesday 20th March 2024 @ 12 Noon
Files Required: A single file named coursework.zip which should contain all of your code at the top-level (i.e.
no subdirectories) and the report file as a PDF. All files should be submitted through Tabula.
10 Support
Support can be found from one of your Teaching Assistants: Stephen Xu (stephen.xu@warwick.ac.uk), James
Macer-Wright james.macer-wright@warwick.ac.uk or the module organiser via email.
請(qǐng)加QQ:99515681  郵箱:99515681@qq.com   WX:codehelp 

掃一掃在手機(jī)打開(kāi)當(dāng)前頁(yè)
  • 上一篇:莆田鞋在哪個(gè)app買(莆田鞋十大良心商家推薦)
  • 下一篇:代寫CS-256、代做Java編程設(shè)計(jì)
  • 無(wú)相關(guān)信息
    合肥生活資訊

    合肥圖文信息
    流體仿真外包多少錢_專業(yè)CFD分析代做_友商科技CAE仿真
    流體仿真外包多少錢_專業(yè)CFD分析代做_友商科
    CAE仿真分析代做公司 CFD流體仿真服務(wù) 管路流場(chǎng)仿真外包
    CAE仿真分析代做公司 CFD流體仿真服務(wù) 管路
    流體CFD仿真分析_代做咨詢服務(wù)_Fluent 仿真技術(shù)服務(wù)
    流體CFD仿真分析_代做咨詢服務(wù)_Fluent 仿真
    結(jié)構(gòu)仿真分析服務(wù)_CAE代做咨詢外包_剛強(qiáng)度疲勞振動(dòng)
    結(jié)構(gòu)仿真分析服務(wù)_CAE代做咨詢外包_剛強(qiáng)度疲
    流體cfd仿真分析服務(wù) 7類仿真分析代做服務(wù)40個(gè)行業(yè)
    流體cfd仿真分析服務(wù) 7類仿真分析代做服務(wù)4
    超全面的拼多多電商運(yùn)營(yíng)技巧,多多開(kāi)團(tuán)助手,多多出評(píng)軟件徽y1698861
    超全面的拼多多電商運(yùn)營(yíng)技巧,多多開(kāi)團(tuán)助手
    CAE有限元仿真分析團(tuán)隊(duì),2026仿真代做咨詢服務(wù)平臺(tái)
    CAE有限元仿真分析團(tuán)隊(duì),2026仿真代做咨詢服
    釘釘簽到打卡位置修改神器,2026怎么修改定位在范圍內(nèi)
    釘釘簽到打卡位置修改神器,2026怎么修改定
  • 短信驗(yàn)證碼 寵物飼養(yǎng) 十大衛(wèi)浴品牌排行 suno 豆包網(wǎng)頁(yè)版入口 wps 目錄網(wǎng) 排行網(wǎng)

    關(guān)于我們 | 打賞支持 | 廣告服務(wù) | 聯(lián)系我們 | 網(wǎng)站地圖 | 免責(zé)聲明 | 幫助中心 | 友情鏈接 |

    Copyright © 2025 hfw.cc Inc. All Rights Reserved. 合肥網(wǎng) 版權(quán)所有
    ICP備06013414號(hào)-3 公安備 42010502001045

    国产人妻人伦精品_欧美一区二区三区图_亚洲欧洲久久_日韩美女av在线免费观看
    日韩亚洲精品视频| 欧美精品久久久久| 亚洲一区二区三区在线观看视频 | 欧美一区二区综合| 91精品免费久久久久久久久| 性欧美在线看片a免费观看| 久久99影院| 男人添女人下部视频免费| 久久亚洲国产成人| www.男人天堂网| 日日噜噜夜夜狠狠久久丁香五月| 国产成人亚洲精品| 欧美国产综合视频| 欧美精品久久久久久久久久| 国产精品91在线观看| 日韩女优在线播放| 久久av中文字幕| 91精品视频播放| 欧美中文字幕在线| 国产精品二区二区三区| www..com日韩| 日韩欧美一区二区三区四区五区| 国产精品精品久久久久久| 草b视频在线观看| 日韩精品一区二区在线视频| 精品久久久久久久久久中文字幕| 国产精品999999| 黄色片免费在线观看视频| 这里只有精品66| zzijzzij亚洲日本成熟少妇| 高清国语自产拍免费一区二区三区 | 国产精品一色哟哟| 无码中文字幕色专区| 国产精品免费小视频| 99精品一区二区三区的区别| 欧美视频在线观看网站| 综合色婷婷一区二区亚洲欧美国产| 国产www免费| 国产精品综合网站| 青青草国产精品| 亚洲天堂电影网| 国产精品嫩草在线观看| 91国在线高清视频| 国产中文字幕免费观看| 日本视频精品一区| 欧美日韩爱爱视频| 国产v综合v亚洲欧美久久| 国产日韩欧美日韩大片| 日韩av成人在线| 又粗又黑又大的吊av| 国产精品入口夜色视频大尺度| 91精品久久久久久久久久久久久| 国内偷自视频区视频综合| 日本a视频在线观看| 亚洲最大福利网站| 国产精品久久不能| 久久久久久久久四区三区| 成人久久18免费网站图片| 韩国精品久久久999| 日本一区二区在线视频观看| 中文精品一区二区三区| 国产成人精品网站| 久久人人爽国产| 超碰国产精品久久国产精品99| 精品一区2区三区| 日本精品一区二区 | 国产欧美日韩亚洲| 男女超爽视频免费播放| 日日鲁鲁鲁夜夜爽爽狠狠视频97| 自拍另类欧美| 国产精品第七十二页| 国产成人精品一区二区三区 | 亚洲v国产v在线观看| 欧美精品在线第一页| 国产精品免费久久久久久| www国产精品com| 久久久久久亚洲精品不卡| 久久亚洲国产精品日日av夜夜| 99re在线视频上| 99热在线这里只有精品| 国产精品一区二区a| 国产视频一区二区三区在线播放| 精品欧美一区二区在线观看视频| 欧美一级电影久久| 欧美又粗又长又爽做受| 欧美一区三区二区在线观看| 欧美在线观看网址综合| 日本免费一级视频| 日韩欧美在线播放视频| 人妻有码中文字幕| 欧美最猛性xxxx| 欧美视频免费看欧美视频| 热草久综合在线| 欧洲亚洲免费视频| 欧美一区观看| 激情深爱综合网| 免费一区二区三区| 国产一区欧美二区三区| 国产欧美精品一区二区三区介绍| 国产日产欧美视频| 国产九色91| 91精品视频专区| 91久久精品www人人做人人爽| 91国产在线播放| 久久99精品久久久久子伦| 精品国产依人香蕉在线精品| 国产精品老女人视频| 九九九热精品免费视频观看网站| 色综合天天综合网国产成人网| 亚洲一区二区三区av无码| 亚洲国产欧美一区二区三区不卡| 日韩av一级大片| 欧美日韩国产免费一区二区三区| 激情伦成人综合小说| 国产精品永久免费视频| 91美女福利视频高清| 涩涩日韩在线| 热久久视久久精品18亚洲精品| 欧美亚洲国产成人| 国产一区二区视频播放| www国产免费| 久久精品国产第一区二区三区最新章节| 久久久一本精品99久久精品| www.日韩欧美| 精品综合久久久久久97| 亚洲国产精品久久久久爰色欲| 日韩av在线一区二区三区| 国自在线精品视频| 成人短视频在线观看免费| 久久人人爽人人爽人人片av高清 | www.com毛片| 久久久久免费视频| 国产精品国产精品| 中文字幕日本最新乱码视频| 日韩av不卡播放| 国产在线98福利播放视频| 97人人模人人爽人人喊38tv| 日韩一区二区欧美| 精品国产一区三区| 欧美一区二区三区四区在线观看地址| 欧美在线观看视频| 国产欧美日韩精品丝袜高跟鞋 | 99久热在线精品视频| 久草免费福利在线| 超在线视频97| 日本福利视频导航| 国产欧美精品日韩精品| 久久久久久久久久久久久国产精品| 国产精品国产三级国产aⅴ9色| 天堂av一区二区| 精品视频在线观看一区二区 | 尤物国产精品| 欧美中文在线免费| 国产精品伊人日日| 色伦专区97中文字幕| 一区二区三区av在线| 欧美综合在线播放| 成人国产精品av| 国产精品久久一| 日本一区二区三区四区在线观看| 国产免费一区二区视频| 日韩在线观看免费高清| 亚洲欧洲三级| 国产一区二区在线观看免费播放 | 日本免费成人网| 国产精品一区二区电影| 色老头一区二区三区在线观看| 一本久道久久综合狠狠爱亚洲精品| 青青视频免费在线观看| 97国产精品免费视频| 久久综合国产精品台湾中文娱乐网| 日韩亚洲欧美一区二区| 97精品一区二区三区| 精品免费日产一区一区三区免费 | 亚洲乱码国产一区三区| 免费在线成人av| 久久精品五月婷婷| 亚洲色成人www永久在线观看 | 国产精品偷伦一区二区| 欧美一级日本a级v片| 国产精品午夜一区二区欲梦| 国产精品青草久久久久福利99| 日韩av观看网址| 91九色视频在线| 中文字幕av久久| 国产视频一区二区三区在线播放| 国产精品人成电影在线观看| 日本一区二区在线免费播放| 91精品国产91久久久久久吃药| 在线观看日韩羞羞视频| 免费av网址在线| 国产精品免费在线| 欧美精品在线一区| 日韩日本欧美亚洲| 日韩精品大片| 久久久精品亚洲| 欧美日韩天天操| 久久视频国产精品免费视频在线 | 日韩国产小视频| 久久免费视频3|