vmpooler

mirror of https://github.com/puppetlabs/vmpooler.git synced 2026-05-31 15:15:20 -04:00

Author	SHA1	Message	Date
kirby@puppetlabs.com	3c856d7ae9	(POOLER-133) Identify when a ready VM has failed This commit fixes checking of a VM that has already been identified as ready. Without this change a ready VM that has failed will be identified as having failed, but will not successfully be removed from the ready queue. Additionally, the default vm_checktime value has been reduced from 15 to 1 to ensure that ready VMs are checked within one minute of the time they have reached the ready state by default. Lastly, the docker-compose files are updated to specify that the redis instance used is a local redis instance.	2018-12-03 12:21:08 -08:00
kirby@puppetlabs.com	b1586393c7	Document backend_weight setting	2018-09-13 09:32:20 -07:00
Samuel Beaulieu	3aac95d5b9	(POOLER-114) Refactor check_pool in pool_manager This commit refactorss the check_pool method in pool_manager. Specifically, each commented section describing a stage of check_pool is broken out into a separate method and check_pool is simplified by calling these methods. Without this change it is difficult to follow the intent for or make changes to check_pool. Additionally, a docker-compose file is added to make it simple to launch an all-in-one vmpooler instance along with a separate redis server with docker.	2018-08-23 14:04:52 -07:00
kirby@puppetlabs.com	4feb9c62a2	(doc) Document config via environment This commit adds documentation for specifying vmpooler configuration via environment variables. LDAP server is corrected to LDAP host, and the capability to specify graphite prefix and port is added. Vagrant documentation is moved into its own file to reduce noise in the main readme. Lastly, docker usage is updated to reflect that you no longer bake in a configuration file, and that API and manager can be run separately from docker via its entrypoint.	2018-07-20 15:30:54 -07:00
Steve Axthelm	c3d90dbfa6	(maint) change domain to example.com Prior to this commit the docs and examples used 'company.com' (a real domain). This commit changes those occurrences to 'example.com', which is a IANA-managed reserved domain.	2018-07-19 15:23:30 -07:00
kirby@puppetlabs.com	d9e1feadea	Add documentation for new parameters to example configuration files	2018-07-13 12:24:55 -07:00
Samuel	89e1f17738	Merge pull request #261 from mattkirby/fix_ignore_cert Change default vsphere connection behavior	2018-06-20 15:27:40 -05:00
kirby@puppetlabs.com	9bb4df7d8e	(POOLER-107) Add configuration API endpoint This commit adds a configuration endpoint to the vmpooler API. Pool size, and pool template, can be adjusted for pools that are configured at vmpooler application start time. Pool template changes trigger a pool refresh, and the new template has delta disks created automatically by vmpooler. Additionally, the capability to create template delta disks is added to the vsphere provider, and this is implemented to ensure that templates have delta disks created at application start time. The mechanism used to find template VM objects is simplified to make the flow of logic easier to understand. As an additional benefit, performance of this lookup is improved by using FindByInventoryPath. A table of contents is added to API.md to ease navigation. Without this change API.md has no table of contents and is difficult to navigate. Add mutex object for managing pool configuration updates This commit adds a mutex object for ensuring that pool configuration changes are synchronized across multiple running threads, removing the possibility of two threads attempting to update something at once, without relying on redis data. Without this change this is managed crudely by specifying in redis that a configuration update is taking place. This redis data is left so the REPOPULATE section of _check_pool can still identify when a configuration change is in progress, and prevent a pool from repopulating at that time. Add wake up event for pool template changes This commit adds a wake up event to detect pool template changes. Additionally, GET /config has a template_ready section added to the output for each pool, which makes clear when a pool is ready to populate itself.	2018-06-15 10:15:47 -07:00
kirby@puppetlabs.com	af8b73b6c9	Change default vsphere connection behavior This commit changes the vsphere connection behavior to set insecure false. Without this change insecure is always set to true when making a connection regardless of the setting provided with the provider configuration.	2018-06-08 12:40:03 -07:00
kirby@puppetlabs.com	82b9033e83	Document parameters added in vmpooler.yaml.example This change documents new vshpere specific parameters introduced related to host selection and folder creation. Without this change these paremeters are not documented.	2018-01-10 12:02:58 -08:00
kirby@puppetlabs.com	a3516d4913	Specify that migration_limit is vsphere only	2018-01-10 12:02:58 -08:00
Rob Braden	9b0e55f959	Merge pull request #227 from glennsarti/add-check-skew (GH-226) Use a dynamic pool_check loop period	2017-07-12 23:19:51 -07:00
Glenn Sarti	30946fab8e	(GH-226) Use a dynamic pool_check loop period Previously the check_pool would always check the pool every 5 seconds, however with a large number of pools, this can cause resource issues inside the providers. This commit: - Introduces a dynamic check_pool period which increases during stability and decreases when the pool is being change in an important way - Surfaces the settings as global config defaults but can also be set on a per pool basis - Adds defaults to emulate the current behaviour - Unit tests for the new behaviour	2017-07-12 17:13:21 -07:00
Glenn Sarti	7077eadc65	(POOLER-83) Add ability to specify a datacenter for vsphere Previously the vsphere provider assumed that there was one and only one datacenter (DC) in the vsphere instance. However this is simply not true for many vSphere installations. This commit: - Adds the ability to define a vSphere datacenter at the Pool or Provider level whereby the Pool setting takes precedence - If no datacenter is specified the default behaviour of picking the first DC in the vSphere instance - Updated all tests for the new setting - Update the vmpooler configuration file example with relevant setting name and expected behaviour - Fixed a bug in the rvmomi_helper whereby if no DC was found it would return all DCs. This is opposite behaviour of the real RBVMOMI library as it returns nil	2017-06-20 15:48:30 -07:00
Samuel Beaulieu	d93ab332f7	Adding support for multiple vsphere providers Refactoring the vmpooler.yaml format to support multiple providers. The second level key under :providers: is a unique key name that represents a provider that can be refered in the pool's parameter called provider. The code is still backward compatible to support the :vsphere: and :dummy: keys but in reality if you have more than one vsphere configuration you would give them a different name. For example :vsphere-pdx: and :vsphere-bfs: and the actual provider class would be specified as a parameter called 'provider_class'. See tests and examples for more information.	2017-06-09 21:44:03 +01:00
Glenn Sarti	2f37c1e9b5	(POOLER-52) Modify dummy provider to use a connection pool Previously a connection pooler class was added. This commit modifies the Dummy VM Provider to use a connection pooler. While the Dummy provider strictly speaking does not use connections, this allows testing to see what happens when connection pools are stressed or exhausted. This commit: - Modifies functions to use a connection pool object for the public API functions - Modifies the VMPooler YAML with new settings for connection pool size and timeout	2017-05-17 13:52:28 -07:00
Glenn Sarti	2e255a5a43	(POOLER-70) Move vSphere configuration into providers section Previously the vSphere based configuration was in the root of the configuration YAML. As there is deprecation support to move the old configuration to the new location, the vSphere provider can be updated. This commit updates the vSphere Provider and tests to use the new configuration location under: :providers: :vsphere:	2017-05-17 13:52:28 -07:00
Glenn Sarti	77afc86aeb	(POOLER-72) Add Dummy Provider Previously the only VM Provider was vSphere however this made testing and making changes difficult as it required a functioning vSphere instance. This commit adds a Dummy Provider which presents a VM provider to Pool Manager but manages provisioned "VM"s in a hashtable. The Dummy Provider can also be configured to randomly fail operations and take random amounts of time to perform operations, such as cloning a VM, which is useful to see how the Pool Manager copes with these events. This commit also updates the configuration YAML documentation and adds appropriate unit tests.	2017-05-17 13:52:27 -07:00
Glenn Sarti	2c74f4fa3a	(POOLER-71) Add dummy authentication provider Previously it was difficult to do local development as VMPooler requires an LDAP service for authentication. This commit adds a dummy authentication provider. The provider has passes authentication if the username and password are different, and fails if the username and password are the same. This commit also updates the documentation in the config YML file.	2017-02-09 16:23:10 -08:00
kirby@puppetlabs.com	86aedd0754	Add retry logic with a delay for vsphere connections This commit adds retry logic and configurable delays to vsphere helper. Without this change vmpooler instances that have large numbers of pools can create enough connections in a short period of time to cause vcenter issues.	2016-11-29 03:43:18 -08:00
kirby@puppetlabs.com	58a548bc90	Add support for migrating VMs to pool_manager. This commit adds a capability to pool_manager to migrate VMs placed in the migrating queue. When a VM is checked out an entry is created in vmpooler__migrating. The existing process for evaluating VM states executes the migrate_vm method for the provided VM, and removes it from the queue. The least used compatible host for the provided VM is selected and, if necessary, a migration to the lesser used host is performed. Migration time and time from the task being queued until completion are both tracked with the redis VM object in 'migration_time' and 'checkout_to_migration'. The migration time is logged in the vmpooler.log, or the VM is reported as not requiring migration. Without this change VMs are not evaluated for checkout at request time. Add a method to wrap find_vm and find_vm_heavy in order to allow a single operation to be performed that does both. This commit also adds support for a configuration setting called migration_limit that makes migration at checkout optional. Additionally, logging is added to report a VM parent host when it is checked out. Without this change vmpooler assumes that migration at checkout is always enabled. If this setting is not present, or if the setting is 0, then migration at checkout will be disabled. If the setting is greater than 0 then that setting will be used to enforce a limit for the number of simultaneous migrations that will be evaluated. Documentation of this configuration option is added to the vmpooler.yaml.example file.	2016-11-08 16:32:10 -08:00
Rick Sherman	34b93ca6c3	Merge CI.next into Master (#161 ) * [QENG-3919] Make vmpooler checkouts be all or nothing (#153) * (QENG-3919) spike for implementation of all-or-nothing checkout * Fix two botched variable references * Aggregate API helper methods * Add specs for failed multi-vm allocation API endpoints * (QENG-3919) Add tests for multiple vm requests * (QENG-3919) Add (failing) specs for POST /vm/pool1+pool2 usages This exposes the old (bad) behavior on this other code path. Will fix this up next. * (QENG-3919) Bring query params version in line with JSON post version Not clear to me why these had to be implemented so differently. * (QENG-3919) extract common method from both methods of VM allocation * (QENG-3919) Naming fix, cosmetic cleanups I mean, I presume all these commits are going to get squashed away on merge anyway. * (QENG-3919) Update API docs We consider it a bug that the actual behavior was not this behavior, but the documentation was also silent on this point. * (QENG-3919) minor readability tweak in refactored method * (QENG-3919) Clean up interim comments re: status codes * (QENG-3919) Drop now-orphaned `checkout_vm` method We kept this up-to-date while we were upgrading and refactoring, but, turns out, this method is no longer called anywhere. 💀 🔥 * (QENG-3919) Return 503 status on failed allocation Making sure we go back to the original functionality, which was: - status 200 when vms successfully allocated - status 404 when a pool name is unknown - status 404 when no pool name is specified - status 503 when vm allocation failed * (QENG-3919) add net-ldap to Gemfile Maybe we shouldn't foil-ball gems onto servers. * (QENG-3919) Turns out, spush isn't a redis command And hence we see once again the weakness of mockist tests. * (QENG-3919) Pin the net-ldap gem to 0.11 for the jrubies, etc. * (QENG-3919) Correct an old spelling error in spec descriptions * (QENG-3919) Further tweak net-ldap version * (QENG-3919) return_single_vm -> return_vm_to_ready_state cc @shermdog * (RE-7014) Add support for statsd They way we were using graphite was incorrect for the type of data we were sending it. statsd is the appropriate mechanism for our needs. statsd and graphite are mutually exclusive and configuring statsd will take precendence over Graphite. Example of configuration in vmpooler.yaml.example * (RE-7014) Add tracking of vm gets via statsd Add the tracking of successful, failed, invalid, and empty pool vm gets. It is possible we may want to tweak this, but have validated with spec tests and pcaps. ``` vmpooler-tmp-dev.ready.debian-7-x86_64:1\|c vmpooler-tmp-dev.running.debian-7-x86_64:1\|c vmpooler-tmp-dev.checkout.invalid:1\|c vmpooler-tmp-dev.checkout.success.debian-7-x86_64:1\|c vmpooler-tmp-dev.checkout.empty:1\|c vmpooler-tmp-dev.running.debian-7-x86_64:1\|c vmpooler-tmp-dev.clone.debian-7-x86_64:12.10\|ms vmpooler-tmp-dev.ready.debian-7-x86_64:1\|c ``` * (RE-7014) statsd nitpicks and additional rspec Cleaned up some code review nitpicks and added pool_manager_spec for empty pool. * (RE-7014) update statsd to use gauge for running/ready Previously was using increment which was incorrect for that particular application. * Revert "Merge pull request #155 from shermdog/RE-7014-cinext" This reverts commit `cc03a86f6a`, reversing changes made to `5aaab7c5c2`. * (QENG-4070) Consistently return 503 if valid pool is empty There were several problems with how the pooler checked out vms with respect to empty pools, invalid pools, and aliases: - If the vmpooler config did not contain any aliases and the caller requested a vm from an empty pool or a non-existent one, the vmpooler would error with: NoMethodError - undefined method `[]' for nil:NilClass If the config contained a non-nil alias section, then: - If the caller requested a vm from an empty pool and either the vm didn't have an alias or the aliased pool was empty or non-existent, then the request for that vm would be silently ignored. The vmpooler would return 200 if the caller asked for multiple vms and the vmpooler was able to checkout at least one vm. Otherwise it would return 404. - Similarly, if the caller requested a vm from a non-existent pool, then the request was silently ignored. This commit adds a `pool_names` Set to the config containing all valid pool names including aliases. This is used to determine whether a requested template name is valid or not. This is necessary because redis does not distinguish between empty and non-existent sets, e.g. the following returns false in both cases: backend.exists('vmpooler__ready__' + key) If the caller requests a vm (single or multiple), and any vm references an invalid pool name, we immediately return 404. Otherwise, we know the request is for valid pool names, since the vmpooler requires a restart to change pool names and counts. We then attempt to acquire each vm, trying to match on pool name or failing back to aliased pool name, as was the previous behavior. The resulting behavior is: - If the caller asks for at least one vm from an unknown pool, then don't try to checkout any vms and respond with 404. - If the caller asks for a vm, and at least one pool is empty, then respond with 503, returning checked out vms back to the pool. - Otherwise return 200 with the list of checked out vms. This commit also makes `alias` optional again. This commit also re-enables tests that were merged in from master, but originally commented out due to the bugs described above.. * (maint) Add json pessimistic pin json 2.0.x was released on July 1 and is not compatible with ruby < 2.0. Since we still support that version, add a pessimistic pin, which is what we were using prior to July 1. * [QENG-4070] Make json version conditional on RUBY_VERSION * Drop extraneous mocks from updated test * Revert "Revert "Merge pull request #155 from shermdog/RE-7014-cinext"" This reverts commit `0fd6fff934`. * Fix some spec errors These were caused in part by dropping changes from the original PR when we dropped the v1_spec.rb master test file (in favor of the updated and separated versions). * [QENG-4075] Fix bug with template name on allocation failure We're returning [nil,nil] in this case, meaning that name will not be set. This means we'll get an error trying to concatenate the stats string. Use the requested template name here instead. * [QENG-4075] Refactor statsd methods / classes Prior to this we could easily run into situations where `statds_prefix` would be `nil` (and possibly the `statsd` handle itself). There was some significant complexity and brittleness in how statsd was set up. Refactored so that: - `statsd_prefix` is no longer exposed to any callers of statsd methods - there is now a `Vmpooler::DummyStatsd` class which can be returned when we are not actually going to publish stats, but would like to keep the calling interface consistent - setup of the statsd handle is via just passing in `config[:statsd]`, if `nil`, this will result in a dummy handle being return - defaulting of `server` values was fixed -- this did not actually work in the previous implementation. `config[:statsd][:server]` is now required. - tests use a `DummyStatsd` instance instead of an rspec double. - calls to `statsd.increment` were taking incorrect arguments (some our fault, some part of the prior implementation), and were not collecting data on which pools were "invalid" or "empty". Fixed this and are now explicitly tracking the invalid/empty pool names. * [QENG-4075] Drop now-superfluous :statsd config defaulting * [QENG-4075] Unify graphite and statsd for the pool manager Prior to this, the `pool_manager.rb` library could take handles for both graphite and statsd endpoints (which were considered mutually exclusive), and then would use one. There was a bevy of conditional logic around sending metrics to the graphite/statsd handles (and actually at least one bug of omission). Here we refactor more, building on earlier work: - Our graphite class comes into line with the API of our Statsd and DummyStatsd classes - In `pool_manager.rb` we now accept a single "metrics" handle, and we drop all the conditional logic around statsd vs. graphite - We move the inconsistent error handling out of the calling classes and into our metrics classes, actually logging to `$stderr` when we can't publish metrics - We unify the setup code to use `config` to determine whether statsd, graphite, or a dummy metrics handle should be used, and make that happen. - Cleaned up some tests. We could probably stand to do a bit more work in this area. * [QENG-4075] Clean up pool manager, specs Prior to this, `pool_manager.rb` allowed the `metrics` argument to be optional, but at this point it will be an instance of `Vmpooler::Statsd`, 'Vmpooler::Graphite', or `Vmpooler::DummyStatsd`, so making this non-optional. Cleaned up that file's tests, cosmetically, as well as recognizing that the behavioral difference between graphite and statsd does not depend on the pool manager. * [QENG-4075] update example vmpooler.yaml file This documents the changes to :server being mandatory for all metrics endpoints, as well as the graphite endpoint supporting an optional :port configuration value. * [QENG-4075] Rename usages of statsd -> metrics Really, let's just support a generic metrics interface. * (maint) move statsd-ruby require into Vmpooler::Statsd class We've managed to move mentions of this out of the calling code, so let's move the require. * (maint) metrics.log -> metrics.timing We missed this during the refactoring. Bringing this up to date. * [QENG-4075] Allow specifying 'graphs:' for dashboard Prior to this the dashboard front-end would use the configuration settings for `graphite[:server]`/`graphite[:prefix]` to locate a graphite server to use for rendering graphs. Now that we have multiple possible metrics backends, the front-end graph host for the dashboard could be entirely different from the back-end metrics server that we publish to (if any). This decouples those settings: - use `graphs[:server]` / `graphs[:prefix]` for the graphite-compatible web front-end to use for dashboard display graphs - fall back to `graphite[:server]`/`graphite[:prefix]` if `graphs` is not specified, in order to support legacy `vmpooler.yaml` configurations. Note that since `statsd` takes precedence over `graphite`, it's possible to specify both `statsd` (for publishing) and `graphite` (for reading). We still prefer `graphs` over `graphite`. Updated the example `vmpooler.yaml` config file. * (maint) fix variable reference in new_metrics This was referencing config directly, when what we want is for a hash to be passed in (derived from config). * (maint) Fix typo in updated graph link call * (maint) default :graphs prefix to 'vmpooler' * (maint) Fix parse error in vmpooler script The things you find through manual QA 🧌 * (maint) use strings instead of symbols in config Nested hash data comes back with string keys, not symbols. Be consistent. * [QENG-4075] Factor out Vmpooler::DummyStatsd This makes it visible to lib/vmpooler.rb, as well as putting this dummy metrics endpoint in its own file for easier discovery. * (maint) clean up statsd inclusion and require lines The library is actually required as 'statsd' and not 'ruby-statsd', best I can tell. * (maint) construct ::Statsd instead of Statsd Because it's ambiguous in this scope, and, well, it doesn't actually work in production. * [QENG-4075] Also track completely invalid requests When we don't even get a pool name we still want metrics to be recorded.	2016-07-25 10:43:32 -05:00
FOXX	38a184c6f5	Made adjustments from colinPL	2016-01-14 12:27:46 -06:00
FOXX	10e507c262	Added prefix parameter to the vmpooler configuration	2016-01-14 11:42:12 -06:00
Scott Schneider	17b24d69ad	Allow pool 'alias' names The following pool configuration would allow a pool to be aliased in POST requests as 'centos-6-x86_64', 'centos-6-amd64', or 'centos-6-64': ````yaml - name: 'centos-6-x86_64' alias: [ 'centos-6-amd64', 'centos-6-64' ] template: 'templates/centos-6-x86_64' folder: 'vmpooler/centos-6-x86_64' datastore: 'instance1' size: 5 ```` The 'alias' configuration can be either a string or an array. Note that even when requesting an alias, the pool's 'name' is returned in the JSON response: ```` $ curl -d '{"centos-6-64":"1"}' --url vmpooler/api/v1/vm ```` ````json { "ok": true, "centos-6-x86_64": { "hostname": "cuna2qeahwlzji7" }, "domain": "company.com" } ````	2015-11-05 11:51:53 -08:00
Scott Schneider	825ade9e12	'clone_target' is overridable pool-level You know, like for a cluster of Mac Minis, maybe.	2015-08-20 22:25:42 -07:00
Scott Schneider	6523062b62	Allow for only a [configurable] tag set	2015-06-30 12:54:46 -07:00
Scott Schneider	1ba1e0fc9b	Add example for tagfilter	2015-06-02 10:53:20 -07:00
Scott Schneider	7ccd9433df	Token documentation and configuration examples	2015-04-28 19:27:21 -07:00
Scott Schneider	7b97e58e0f	Clean up Redis VM objects after redis['data_ttl']	2015-03-20 10:25:51 -07:00
Scott Schneider	b54d42aea4	Allow for a globally-configurable 'pending' timeout	2015-01-08 14:16:13 -08:00
Scott Schneider	e516ddc0b0	Add 'domain' config example	2014-08-29 11:20:32 -07:00
Scott Schneider	e8125e7702	Allow a 'clone_target' configuration setting If set, this setting will enable "fake DRS", cloning VMs onto the host in the specified cluster containing the least amount of VMs.	2014-07-28 10:51:45 -07:00
Scott Schneider	4f3c5323d8	Don't use VMware resource pools for organization This commit removes support for VMware 'resource pool' functionality entirely, as VMware installations without the DRS feature enabled are unable to configure or use resource pools. Also, resource pools should have never been used for organization; that's what folders are for.	2014-07-21 13:49:57 -07:00
Scott Schneider	3af680f860	Allow a configurable Graphite namespace (prefix)	2014-03-06 10:53:27 -08:00
Scott Schneider	565e03445e	Initial	2014-02-19 16:05:27 -08:00

36 commits