Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
72 changes: 72 additions & 0 deletions docs/hardware-requirements.md
Original file line number Diff line number Diff line change
Expand Up @@ -53,8 +53,80 @@ merged upstream.

### Known Issues

### illumos

There are a few known hardware related issues with illumos.

- There have been several issues with Intel CPUs regarding
their C-States. SmartOS has worked around them, but you should
consider disabling them in your BIOS.

### SmartOS and Triton Datacenter

- SmartOS depends upon the hardware bios serial number in order to
generate a system UUID on boot. This UUID can be displayed using the
`sysinfo` command. In some rare cases, such as with "Dell PowerEdge
c6100" blade-type servers, the main chassis can incorrectly assign the
same serial number across all of the blades installed in the same unit.
This can affect the behavior of some software, such as "Triton
Datacenter" (SDC). Triton directly relies on the UUID extracted from
the chassis serial number by SmartOS to be entirely unique in order
for it to detect a new compute node prior to setup.

### Example Scenario

In the case of "Triton Data Center" and the aforementioned, "Dell
c6100" chassis, 1 compute node (blade/sled), will be properly detected
by "cnapi" and consequently, the "Operator Portal" on boot, while the
other 3 quietly PXE boot without detection by Triton. To determine if
a duplicate server UUID is the cause of your issue, simply ssh into
each of the compute nodes in question, and run: `sysinfo | json UUID`.
If more than 1 compute node share the same UUID, then a duplicate
serial number is likely the cause of the issue. You can also verify
the duplicate serial numbers on each node with the following:

`ipmitool fru print 0`

You should receive output resembling this:

Chassis Type : Rack Mount Chassis
Chassis Part Number :
Chassis Serial :
Board Mfg Date : Wed Nov 7 02:43:00 2012
Board Mfg : Dell Inc.
Board Product : PowerEdge
Board Serial : CN0D61XP747512B60255A08
Board Part Number : 282BNP0616
Product Manufacturer : Dell Inc.
Product Name : C6100
Product Part Number :
Product Version :
Product Serial : DB3KYV1
Product Asset Tag:
Comment on lines +92 to +105

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This block needs to be further indented to be a code block

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It must be something with the browser cache when reviewing with 'make serve'. They were all code blocks but refreshes would break the formatting. Ill put them back in.


To work around the issue, you must set a unique serial number for
each compute node using `ipmitool`. SmartOS compute nodes come with
`ipmitool` preinstalled so this is as easy as:

- SSH to each affected compute node.
- On your local desktop, randomly generate, as unique as possible,
a new serial number. In my scenario I simply used `pwgen` on my Mac
to generate a 7 digit, random, alpha-numeric string. But you can
probably use "/dev/urandom", python, openssl or a myriad of other
tools to achieve the same result.
`pwgen -sB 7 1`
- On each node run the following three commands:

ipmitool fru edit 0 field c 1 <NEW_SERIAL>
ipmitool fru edit 0 field b 2 <NEW_SERIAL>
ipmitool fru edit 0 field p 4 <NEW_SERIAL>

_Note: To update the serial number, each of the above command's
must be executed on each compute node using the respective new
"serial number"._

- Double check that the new serial number has been set:
ipmitool fru print 0
- Reboot the compute node.

The compute node should now be properly detected by Triton.
Loading