[Doc] update profiling.md #70

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Open

linuxholic wants to merge 1 commit into microsoft:main from linuxholic:patch-1

Open

docs/profiling.md

-Original file line number
+Diff line change
@@ Expand Up @@
     ## Network (Collectives) profiling
-    Network profiling is not dependent on the model 🎉. So, we can use the same network profiling data for all models. However, we need to ensure that the network profiling data is available for the node configuration we are using. If not, then we need to profile the network for the device. 1.
+    Network profiling is not dependent on the model 🎉. So, we can use the same network profiling data for all models. However, we need to ensure that the network profiling data is available for the node configuration we are using. If not, then we need to profile the network for the device.
     For network profiling, the node setup i.e. type of connectivity between the gpus matter. This is why we have the concept of `network_device`. The network_device is an informal name for the network configuration of the node. Eg: `a100_pair_nvlink`, `a100_dgx`, `h100_dgx` etc.
-. For tensor parallelism, 4 GPUs are needed for TP4 and 8 GPUs are needed for TP8 etc.
-. For pipeline parallelism across nodes, 2 nodes are needed to profile the link between the nodes.
+. For tensor parallelism, 4 GPUs are needed for TP4 and 8 GPUs are needed for TP8 etc.
+. For pipeline parallelism across nodes, 2 nodes are needed to profile the link between the nodes.
     Currently available data include:
@@ Expand Down @@

Provide feedback